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Preface 



The papers in this volume were selected for presentation at the 6th Annual 
International Computing and Combinatorics Conference (COCOON 2000), in 
Sydney, Australia from July 26 - 28, 2000. The topics cover many areas in the- 
oretical computer science and combinatorial optimization. 

There were 81 high quality papers submitted to COCOON 2000. Each paper 
was reviewed by at least three program committee members, and the 44 papers 
were selected. It is expected that most of them will appear in a more complete 
form in scientific journals. In addition to the selected papers, the volume also 
contains the papers from two invited keynote speeches by Christos Papadim- 
itriou and Richard Brent. 

This year the Hao Wang Award was given to honor the paper judged by the 
program committee to have the greatest merit. The recipient is “Approximating 
Uniform Triangular Meshes in Polygons” by Franz Aurenhammer, Naoki Katoh, 
Hiromichi Kojima, Makoto Ohsaki, and Yinfeng Xu. The first Best Young Re- 
searcher paper award was given to William Duckworth for his paper “Maximum 
Induced Matchings of Random Cubic Graphs” . 

We wish to thank all who have made this meeting possible: the authors for 
submitting papers, the program committee members and the external referees, 
sponsors, the local organizers, ACM SIGACT for handling electronic submis- 
sions, Springer- Verlag for their support, and Debbie Hatherell for her assistance. 
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Theoretical Problems Related to the Internet 

(Extended Abstract) 



Christos H. Papadimitriou* 

University of California, Berkeley, USA, 
Christ osOcs .berkeley.edu 
http : //www. cs .berkeley.edu/~christos/ 



The Internet has arguably superseded the computer as the most complex co- 
hesive artifact (if you can call it that), and is, quite predictably, the source of 
a new generation of foundational problems for Theoretical Computer Science. 
These new theoretical challenges emanate from several novel aspects of the In- 
ternet: (a) Its unprecedented size, diversity, and availability as an information 
repository; (b) its novel nature as a computer system that intertwines a multi- 
tude of economic interests in varying degrees of competition; and (c) its history 
as a shared resource architecture that emerged in a remarkably ad hoc yet glori- 
ously successful manner. In this talk I will survey some recent research (done in 
collaboration with Joan Feigenbaum, Dick Karp, Elias Koutsoupias, and Scott 
Shenker, see PEI) on problems of the two latter kinds. See m for examples of 
theoretical work along the first axis. 

To a large extent, the Internet owes its remarkable ascent to the surprising 
empirical success of TCP’s congestion control protocol A flow between two 
points strives to determine the correct transmission rate by trying higher and 
higher rates, in a linearly increasing function. As links shared by several flows 
reach their capacity, one or more packets may be lost, and, in response, the 
corresponding flow halves its transmission rate. The fantastic empirical success 
of this crude mechanism raises important, and at times theoretically interesting, 
questions related to novel search algorithms (the process clearly contains a novel 
variant of binary search), on-line algorithms (when the true capacity that a 
flow can use changes in an unknown and, in a worst case, adversarial fashion), 
and algorithmic game theory (why is it that flows seem to be unwilling, or 
unmotivated, to move away from TCP?). 

In a situation in which the same piece of information is multicast to millions 
of individuals, it is of interest to determine the price that each recipient must pay 
in return. The information in question is of different value to each individual, 
and this value is known only to this individual. The participants must engage 
in a mechanism, that is to say, a distributed protocol whereby they determine 
the individuals who will actually receive the information, and the price each will 
pay. Economists over the past fourty years have studied such situations aris- 
ing in auctions, welfare economics, and pricing of public goods, and have come 
up with clever mechanisms — solutions that satisfy various sets of desiderata 

* Research partially supported by the National Science Foundation. 
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(“axioms”) — or ingenious proofs that satisfying other sets of desiderata is im- 
possible. In the context of the Internet, however, an important computational 
desideratum emerges: The protocol should he efficient, with respect to the total 
number of messages sent. It turns out that this novel condition changes the sit- 
uation dramatically, and carves out a new set of pricing mechanisms that are 
appropriate and feasible. 
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Recent Progress and Prospects for Integer 
Factorisation Algorithms 



Richard P. Brent 



Oxford University Computing Laboratory, 
Wolfson Building, Parks Road, 
Oxford 0X1 3QD, UK 
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http : //www. comlab. ox . ac.uk/ 



Abstract. The integer factorisation and discrete logarithm problems 
are of practical importance because of the widespread use of public key 
cryptosystems whose security depends on the presumed difficulty of solv- 
ing these problems. This paper considers primarily the integer factorisa- 
tion problem. In recent years the limits of the best integer factorisation 
algorithms have been extended greatly, due in part to Moore’s law and in 
part to algorithmic improvements. It is now routine to factor 100-decimal 
digit numbers, and feasible to factor numbers of 155 decimal digits (512 
bits). We outline several integer factorisation algorithms, consider their 
suitability for implementation on parallel machines, and give examples 
of their current capabilities. In particular, we consider the problem of 
parallel solution of the large, sparse linear systems which arise with the 
MPQS and NFS methods. 



1 Introduction 

There is no known deterministic or randomised polynomial-timc0 algorithm for 
finding a factor of a given composite integer N. This empirical fact is of great 
interest because the most popular algorithm for public-key cryptography, the 
RSA algorithm El, would be insecure if a fast integer factorisation algorithm 
could be implemented. 

In this paper we survey some of the most successful integer factorisation 
algorithms. Since there are already several excellent surveys emphasising the 
number-theoretic basis of the algorithms, we concentrate on the computational 
aspects. 

This paper can be regarded as an update of H, which was written just 
before the factorisation of the 512-bit number RSA155. Thus, to avoid duplica- 
tion, we refer to d for primality testing, multiple-precision arithmetic, the use 
of factorisation and discrete logarithms in public-key cryptography, and some 
factorisation algorithms of historical interest. 

^ For a polynomial-time algorithm the expected running time should be a polynomial 
in the length of the input, i.e. 0((log A)'’) for some constant c. 
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1.1 Randomised Algorithms 

All but the most trivial algorithms discussed below are randomised algorithms, 
i.e. at some point they involve pseudo-random choices. Thus, the running times 
are expected rather than worst case. Also, in most cases the expected running 
times are conjectured, not rigorously proved. 



1.2 Parallel Algorithms 

When designing parallel algorithms we hope that an algorithm which requires 
time Ti on a computer with one processor can be implemented to run in time 
Tp ~ Ti/P on a computer with P independent processors. This is not always 
the case, since it may be impossible to use all P processors effectively. However, 
it is true for many integer factorisation algorithms, provided P is not too large. 

The speedup of a parallel algorithm is S' = Ti/Tp. We aim for a linear 
speedup, i.e. S = 0{P). 



1.3 Quantum Algorithms 

In 1994 Shor |R7l58j showed that it is possible to factor in polynomial expected 
time on a quantum computer H0EI1- However, despite the best efforts of several 
research groups, such a computer has not yet been built, and it remains unclear 
whether it will ever be feasible to build one. Thus, in this paper we restrict our 
attention to algorithms which run on classical (serial or parallel) computers. The 
reader interested in quantum computers could start by reading EDEOI- 



1.4 Moore’s Law 

Moore’s “law” predicts that circuit densities will double every 18 months 

or so. Of course, Moore’s law is not a theorem, and must eventually fail, but it 
has been surprisingly accurate for many years. As long as Moore’s law continues 
to apply and results in correspondingly more powerful parallel computers, we ex- 
pect to get a steady improvement in the capabilities of factorisation algorithms, 
without any algorithmic improvements. Historically, improvements over the past 
thirty years have been due both to Moore’s law and to the development of new 
algorithms (e.g. ECM, MPQS, NFS). 



2 Integer Factorisation Algorithms 



There are many algorithms for finding a nontrivial factor / of a composite inte- 
ger N. The most useful algorithms fall into one of two classes - 
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A. The run time depends mainly on the size of TV, and is not strongly dependent 
on the size of /. Examples are - 

• Lehman’s algorithm m, which has worst-case run time 0{N^^^). 

• The Continued Fraction algorithm m and the Multiple Polynomial 
Quadratic Sieve (MPQS) algorithm j4tipF)t)] . which under plausible as- 
sumptions have expected run time O (exp( V c In In In iV) ) , where c is a 
constant (depending on details of the algorithm). For MPQS, c « 1. 

• The Number Field Sieve (NFS) algorithm |2hl,'-{( )| . which under plausi- 
ble assumptions has expected run time 0(exp(c(lnfV)^/^(lnln A^)^/^)), 
where c is a constant (depending on details of the algorithm and on the 
form of N). 

B. The run time depends mainly on the size of /, the factor found. (We can 

assume that / < Examples are - 

• The trial division algorithm, which has run time 0{f ■ (log A^)^). 

• Pollard’s “rho” algorithm P5|, which under plausible assumptions has 

expected run time ■ (logAf)^). 

• Lenstra’s Elliptic Curve (ECM) algorithm , which under plausible as- 
sumptions has expected run time 0(exp(\/c In / In In /) • (log Af)^), where 
c « 2 is a constant. 



In these examples, the time bounds are for a sequential machine, and the term 
(log Af)^ is a generous allowance for the cost of performing arithmetic operations 
on numbers which are 0{N'^). If N is very large, then fast integer multiplication 
algorithms can be used to reduce the (log A^)^ term. 

Our survey of integer factorisation algorithms is necessarily cursory. For more 
information the reader is referred to 






3 Lenstra’s Elliptic Curve Algorithm 

Lenstra’s elliptic curve method/algorithm (abbreviated ECM) was discovered 
by H. W. Lenstra, Jr. about 1985 (see pg). It is the best known algorithm 
in class B. To save space, we refer to pro and the references there for a 
description of ECM, and merely give some examples of its successes here. 



3.1 ECM Examples 

1. In 1995 we completed the factorisation of the 309-decimal digit (1025-bit) 
Fermat number Fio = 2^ -|- 1. In fact 

Fio = 45592577 • 6487031809 • 

4659775785220018543264560743076778192897 • P 252 

where 46597 • • • 92897 is a 40-digit prime and P 252 = 13043 • • • 24577 is a 
252-digit prime. The computation, which is described in detail in [7j, took 
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about 240 Mips-years. (A Mips-year is the computation performed by a 
hypothetical machine performing 10® instructions per second for one year, 
i.e. about 3.15 x 10^® instructions. Is is a convenient measure but should not 
be taken too seriously.) 

2. The largest factor known to have been found by ECM is the 54-digit factor 

484061254276878368125726870789180231995964870094916937 

of (6^® — 1)"^^ -I- 1, found by Nik Lygeros and Michel Mizony with Paul 
Zimmermann’s GMP-ECM program inni in December 1999 (for more details 
see 0). 

3.2 Parallel/Distributed Implementation of ECM 

ECM consists of a number of independent pseudo-random trials, each of which 
can be performed on a separate processor. So long as the expected number of 
trials is much larger than the number P of processors available, linear speedup 
is possible by performing P trials in parallel. In fact, if T\ is the expected run 
time on one processor, then the expected run time on a MIMD parallel machine 
with P processors is 

Tp = Ti/P+0{Tl'^"^^) ( 1 ) 



3.3 ECM Factoring Records 




1990 1992 1994 1996 1998 2000 2002 



Figure 1: Size of factor found by ECM versus year 



Recent Progress and Prospects for Integer Factorisation Algorithms 



7 



Figure 1 shows the size D (in decimal digits) of the largest factor found by 
ECM against the year it was done, from 1991 (40D) to 1999 (54D) (historical 
data from 0). 



3.4 Extrapolation of ECM Records 




1990 1992 1994 1996 1998 2000 2002 

Figure 2: \/l) versus year Y for ECM 



Let D be the number of decimal digits in the largest factor found by ECM up 
to a given date. From the theoretical time bound for ECM, assuming Moore’s 
law, we expect ^/Ij to be roughly a linear function of calendar year (in fact 
\/ D hi D should be linear, but given the other uncertainties we have assumed for 
simplicity that Vln D is roughly a constant). Figure 2 shows VD versus year Y . 



The straight line shown in the Figure 2 is 
Y — 1932 3 

\/~D = or equivalently Y = Q.sVd + 1932.3 , 

9.3 



and extrapolation gives = 60 in the year Y = 2004 and £) = 70 in the year 
Y = 2010. 



Richard P. Brent 



4 Quadratic Sieve Algorithms 

Quadratic sieve algorithms belong to a wide class of algorithms which try to find 
two integers x and y such that x ^ ±y (mod N) but 

X? = (mod N) . (2) 

Once such x and y are found, then GCD {x — y, N) is a nontrivial factor of N. 

One way to find x and y satisfying m is to find a set of relations of the form 

uf = v^Wi (mod N), (3) 

where the Wi have all their prime factors in a moderately small set of primes 
(called the factor base). Each relation Q gives a colunm in a matrix R whose 
rows correspond to the primes in the factor base. Once enough columns have been 
generated, we can use Gaussian elimination in GF(2) to find a linear dependency 
(mod 2) between a set of columns of R. Multiplying the corresponding relations 
now gives an expression of the form ©. With probability at least 1/2, we have 
X 7 ^ ±y mod N so a, nontrivial factor of N will be found. If not, we need to 
obtain a different linear dependency and try again. 

In quadratic sieve algorithms the numbers Wi are the values of one (or more) 
quadratic polynomials with integer coefficients. This makes it easy to factor the 
Wi by sieving. For details of the process, we refer to Illl32tl5l46l49l52[5il . 

The best quadratic sieve algorithm (MPQS) can, under plausible assump- 
tions, factor a number N in time 0(exp(c(ln A^lnln where c ^ 1. The 

constants involved are such that MPQS is usually faster than EGM if N is the 
product of two primes which both exceed This is because the inner loop 

of MPQS involves only single-precision operations. 

Use of “partial relations”, i.e. incompletely factored Wi, in MPQS gives a 
significant performance improvement j2|. In the “one large prime” (P-MPQS) 
variation Wi is allowed to have one prime factor exceeding m (but not too much 
larger than m). In the “two large prime” (PP-MPQS) variation Wi can have two 
prime factors exceeding m - this gives a further performance improvement at 
the expense of higher storage requirements M- 



4.1 Parallel/Distributed Implementation of MPQS 

The sieving stage of MPQS is ideally suited to parallel implementation. Different 
processors may use different polynomials, or sieve over different intervals with 
the same polynomial. Thus, there is a linear speedup so long as the number of 
processors is not much larger than the size of the factor base. The computa- 
tion requires very little communication between processors. Each processor can 
generate relations and forward them to some central collection point. This was 
demonstrated by A. K. Lenstra and M. S. Manasse 1221 , who distributed their 
program and collected relations via electronic mail. The processors were scat- 
tered around the world - anyone with access to electronic mail and a G compiler 
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could volunteer to contribut^. The final stage of MPQS - Gaussian elimination 
to combine the relations - is not so easily distributed. We discuss this in 
below. 



4.2 MPQS Examples 






MPQS has been used to obtain many impressive factorisations 
At the time of writing (April 2000), the largest number factored by MPQS is 
the 129-digit “RSA Challenge” jS3] number RSA129. It was factored in 1994 
by Atkins et aZ The relations formed a sparse matrix with 569466 columns, 
which was reduced to a dense matrix with 188614 columns; a dependency was 
then found on a MasPar MP-1. It is certainly feasible to factor larger numbers 
by MPQS, but for numbers of more than about 110 decimal digits GNFS is 
faster (21 • For example, it is estimated in that to factor RSA129 by MPQS 
required 5000 Mips-years, but to factor the slightly larger number RSA130 by 
GNFS required only 1000 Mips-years |T^ . 



5 The Special Number Field Sieve (SNFS) 

The number field sieve (NFS) algorithm was developed from the special number 
field sieve (SNFS), which we describe in this section. The general number field 
sieve (GNFS or simply NFS) is described in (0 

Most of our numerical examples have involved numbers of the form 

± 6 , (4) 

for small a and b, although the ECM and MPQS factorisation algorithms do not 
take advantage of this special form. 

The special number field sieve (SNFS) is a relatively new algorithm which 
does take advantage of the special form ( 0 . In concept it is similar to the 
quadratic sieve algorithm, but it works over an algebraic number field defined 
by o, e and b. We refer the interested reader to Lenstra et al I29I30I for details, 
and merely give an example to show the power of the algorithm. 



5.1 SNFS Examples 

Consider the 155-decimal digit number 

Fg = fV = 22" -h 1 

as a candidate for factoring by SNFS. Note that 8N = mfi + 8, where m = 2^°^. 
We may work in the number field Qipi), where a satisfies 

-k8 = 0, 

2 This idea of using machines on the Internet as a “free” supercomputer has been 
adopted by several other computation-intensive projects 
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and in the ring of integers of Q{a). Because 

+ 8 = 0 (mod N), 

the mapping (p : a m mod is a ring homomorphism from Z[a] to Z/NZ . 

The idea is to search for pairs of small coprime integers u and v such that both 
the algebraic integer u + av and the (rational) integer u + mv can be factored. 
(The factor base now includes prime ideals and units as well as rational primes.) 
Because 

4>{u + av) = (m + mv) (mod N), 

each such pair gives a relation analogous to (0. 

The prime ideal factorisation oiu+av can be obtained from the factorisation 
of the norm u^—8v^ of u+av. Thus, we have to factor simultaneously two integers 
u + mv and — 8u®|. Note that, for moderate u and v, both these integers are 
much smaller than N, in fact they are 0{N^^‘^), where d = 5 is the degree of 
the algebraic number field. (The optimal choice of d is discussed in ®) 

Using these and related ideas, Lenstra et al inn factored Fg in June 1990, 
obtaining 

Fg = 2424833- 7455602825647884208337395736200454918783366342657 -pgg, 

where pgg is an 99-digit prime, and the 7-digit factor was already known 
(although SNFS was unable to take advantage of this). The collection of re- 
lations took less than two months on a network of several hundred workstations. 
A sparse system of about 200,000 relations was reduced to a dense matrix with 
about 72,000 rows. Using Gaussian elimination, dependencies (mod 2) between 
the columns were found in three hours on a Connection Machine. These depen- 
dencies implied equations of the form mod Fg. The second such equation 

was nontrivial and gave the desired factorisation of Fg. 

More recently, considerably larger numbers have been factored by SNFS, 
for example, the 211-digit number 10^^^ — 1 was factored early in 1999 by a 
collaboration called “The Cabal” ISI- 

6 The General Number Field Sieve (GNFS) 

The general number field sieve (GNFS or just NFS) is a logical extension of the 
special number field sieve (SNFS). When using SNFS to factor an integer N , we 
require two polynomials f{x) and g{x) with a common root m mod N but no 
common root over the field of complex numbers. If N has the special form m 
then it is usually easy to write down suitable polynomials with small coefficients, 
as illustrated by the two examples given in 0 If has no special form, but is 
just some given composite number, we can also find f{x) and g{x), but they no 
longer have small coefficients. 

Suppose that g{x) has degree d > 1 and f{x) is lineai0. d is chosen empir- 
ically, but it is known from theoretical considerations that the optimum value 

® This is not necessary. For example, Montgomery found a clever way of choosing two 
quadratic polynomials. 
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is 

^ In In ivy 

We choose m = and write 



d 

N = ^ ajm^ 
j=o 

where the aj are “base m digits” . Then, defining 

d 

f{x) =x-m, g{x) = ^ QjX^ , 
J=o 



it is clear that f{x) and g{x) have a common root m mod N. This method of 
polynomial selection is called the “base m” method. 

In principle, we can proceed as in SNFS, but many difficulties arise because of 
the large coefficients of g{x). For details, we refer the reader to 1 f47|lisptil^ . 

Suffice it to say that the difficulties can be overcome and the method works! Due 
to the constant factors involved it is slower than MPQS for numbers of less than 
about 110 decimal digits, but faster than MPQS for sufficiently large numbers, 
as anticipated from the theoretical run times given in ^ 

Some of the difficulties which had to be overcome to turn GNFS into a 
practical algorithm are: 



1. Polynomial selection. The “base m” method is not very good. Peter Mont- 
gomery and Brian Murphy 1401411421 have shown how a very considerable 
improvement (by a factor of more than ten for number of 140-155 digits) 
can be obtained. 

2. Linear algebra. After sieving a very large, sparse linear system over GF(2) 
is obtained, and we want to find dependencies amongst the columns. It is 
not practical to do this by structured Gaussian elimination 123 §5] because 
the “fill in” is too large. Odlyzko j4;-iH Yj and Montgomery I2Z] showed that 
the Lanczos method m could be adapted for this purpose. (This is non- 
trivial because a nonzero vector x over GF(2) can be orthogonal to itself, 
i.e. x"^x = 0.) To take advantage of bit-parallel operations, Montgomery’s 
program works with blocks of size dependent on the wordlength (e.g. 64). 

3. Square roots. The final stage of GNFS involves finding the square root of 
a (very large) product of algebraic number^. Once again, Montgomery m 
found a way to do this. 



An idea of Adleman, using quadratic characters, is essential to ensure that the desired 
square root exists with high probability. 
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6.1 RSA155 

At the time of writing, the largest number factored by GNFS is the 155-digit 
RSA Challenge number RSA155. It was split into the product of two 78-digit 
primes on 22 August, 1999, by a team coordinated from CWI, Amsterdam. For 
details see inn. To summarise: the amount of computer time required to find the 
factors was about 8000 Mips-years. The two polynomials used were 

f{x) = x- 39123079721168000771313449081 



and 



g{x) = -K19377138320a;'^ 

-80168937284997582a;^ 

-662698522341 18574445a;3 

-hi 1816848430079521880356852a;2 

-h7459661580071786443919743056a: 

-40679843542362159361913708405064. 



The polynomial g{x) was chosen to have a good combination of two proper- 
ties: being unusually small over the sieving region and having unusually many 
roots modulo small primes (and prime powers). The effect of the second prop- 
erty alone makes g{x) as effective at generating relations as a polynomial chosen 
at random for an integer of 137 decimal digits (so in effect we have removed 
at least 18 digits from RSA155 by judicious polynomial selection). The polyno- 
mial selection took approximately 100 Mips-years or about 1.25% of the total 
factorisation time. For details of the polynomial selection, see Brian Murphy’s 
thesis mi. 

The total amount of CPU time spent on sieving was 8000 Mips-years on 
assorted machines (calendar time 3.7 months). The resulting matrix had about 
6.7 X 10® rows and weight (number of nonzeros) about 4.2 x 10® (about 62 
nonzeros per row). Using Montgomery’s block Lanczos program, it took almost 
224 CPU-hours and 2 CB of memory on a Cray C916 to find 64 dependencies. 
Calendar time for this was 9.5 days. 



Table 1. RSA130, RSA140 and RSA155 factorisations 





RSA130 


RSA140 


RSA155 


Total CPU time in Mips-years 
Matrix rows 
Total nonzeros 
Nonzeros per row 

Matrix solution time (on Cray C90/C916) 


1000 

3.5 X 10® 
1.4 X 10® 
39 

68 hours 


2000 
4.7 X 10® 
1.5 X 10® 
32 

100 hours 


8000 
6.7 X 10® 
4.2 X 10® 
62 

224 hours 
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Table ni gives some statistics on the RSA130, RSA140 and RSA155 factori- 
sations. The Mips-year times given are only rough estimates. Extrapolation of 
these figures is risky, since various algorithmic improvements were incorporated 
in the later factorisations and the scope for further improvements is unclear. 



7 Parallel Linear Algebra 

At present, the main obstacle to a fully parallel and scalable implementation 
of GNFS (and, to a lesser extent, MPQS) is the linear algebra. Montgomery’s 
block Lanczos program runs on a single processor and requires enough memory 
to store the sparse matrix (about five bytes per nonzero element). It is possible 
to distribute the block Lanczos solution over several processors of a parallel 
machine, but the communication/computation ratio is high j,3?Sj . 

Similar remarks apply to some of the best algorithms for the integer discrete 
logarithm problem. The main difference is that the linear algebra has to be 
performed over a (possibly large) finite field rather than over GF(2). 

In this Section we present some preliminary ideas on how to implement the 
linear algebra phase of GNFS (and MPQS) in parallel on a network of relatively 
small machines. This work is still in progress and it is too early to give definitive 
results. 



7.1 Assumptions 

Suppose that a collection of P machines is available. It will be convenient to 
assume that P = is a, perfect square. For example, the machines might be 
400 Mhz PGs with 256 MByte of memory each, and P = 16 or P = 64. 

We assume that the machines are connected on a reasonably high-speed 
network. For example, this might be multiple lOOMb/sec Ethernets or some 
higher-speed proprietary network. Although not essential, it is desirable for the 
physical network topology to be a g x 5 grid or torus. Thus, we assume that 
a processor P^j in row i can communicate with another processor Pij> in the 
same row or broadcast to all processors Pi_* in the same row without interference 
from communication being performed in other rows. (Similarly for columns.) 
The reason why this topology is desirable is that it matches the communication 
patterns necessary in the linear algebra: see for example ra. 

To simplify the description, we assume that the standard Lanczos algorithm 
is used. In practice, a block version of Lanczos |2Z| would be used, both to take 
advantage of word-length Boolean operations and to overcome the technical 
problem that u^u can vanish for u ^ 0 when we are working over a finite field. 
The overall communication costs are nearly the same for blocked and unblocked 
Lanczos, although blocking reduces startup costs. 

If R is the matrix of relations, with one relation per column, then P is a large, 
sparse matrix over GF(2) with slightly more columns than rows. (Warning: in 
the literature, the transposed matrix is often considered, i.e. rows and columns 
are often interchanged.) We aim to find a nontrivial dependency^ i.e. a nonzero 
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vector X such that Rx = 0. If we choose b = Ry for some random y, and solve 
Rx = b, then x — y is a, dependency (possibly trivial, if x = y). Thus, in the 
description below we consider solving a linear system Rx = b. 

The Lanczos algorithm over the real field works for a positive-definite sym- 
metric matrix. In our case, R is not symmetric (or even square). Hence we apply 
Lanczos to the normal equations 



R^Rx = R^b 



rather than directly to Rx = b. We do not need to compute the matrix A = RF R, 
because we can compute Ax as R7'{Rx). To compute i?^z, it is best to think 
of computing {z^R)'^ because the elements of R will be scattered across the 
processors and we do not want to double the storage requirements by storing 
R"^ as well as R. 

The details of the Lanczos iteration are not important for the description 
below. It is sufficient to know that the Lanczos recurrence has the form 

w^+l = Awi - 

where the multipliers Ci^i and Ci^i-i can be computed using inner products of 
known vectors. Thus, the main work involved in one iteration is the computation 
of Awi = R'^{Rwi). In general, if R has n columns, then Wi is a dense n- vector. 
The Lanczos process terminates with a solution after n + 0{1) iterations [[17] . 

7.2 Sparse Matrix- Vector Multiplication 

The matrix of relations R is sparse and almost square. Suppose that R has n 
columns, m ~ n rows, and pn nonzeros, i.e. p nonzeros per column. Consider 
the computation of Rw, where w is a dense n-vector. The number of multiplica- 
tions involved in the computation of Rw is pn. (Thus, the number of operations 
involved in the complete Lanczos process is of order pn^ <§; .) 

For simplicity assume that q = -\/P divides n, and the matrix R is distributed 
across the q x q array of processors using a “scattered” data distribution to 
balance the load and storage requirements 0. Within each processor Pij a 
sparse matrix representation will be used to store the relevant part of R. 
Thus the local storage of requirement is 0{pn/q^ + n/q). 

After each processor computes its local matrix-vector product with about 
pn/q^ multiplications, it is necessary to sum the partial results (n/g- vectors) in 
each row to obtain n/q components of the product Rw. This can be done in 
several ways, e.g. by a systolic computation, by using a binary tree, or by each 
processor broadcasting its n/g- vector to other processors in its row and summing 
the vectors received from them to obtain n/q components of the product Rw. 
If broadcasting is used then the results are in place to start the computation of 
R'^{Rw). If an inner product is required then communication along columns of 
the processor array is necessary to transpose the dense n-vector (each diagonal 
processor Pi^i broadcasts its n/q elements along its column). 
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We see that, overall, each Lanczos iteration requires computation 
0{pn/q^ + n) on each processor, and communication of 0(n) bits along each 
row/column of the processor array. Depending on details of the communication 
hardware, the time for communication might be 0{{n/q) log q) (if the binary tree 
method is used) or even 0{n/q + q) (if a systolic method is used with pipelining). 
In the following discussion we assume that the communication time is propor- 
tional to the number of bits communicated along each row or column, i.e. 0(n) 
per iteration (not the total number of bits communicated per iteration, which is 
0{nq)). 

The overall time for n iterations is 



Tp « apn^ /P + , 



( 5 ) 



where the constants a and (3 depend on the computation and communication 
speeds respectively. (We have omitted the communication “startup” cost since 
this is typically much less than f3n3.) 

Since the time on a single processor is Ti « apn? , the efficiency Ep is 



Ep 



Ti 

PTp 



ap 

ap + (3P 



and the condition for Ep > 1/2 is 



ap > (3P . 

Typically (3 /a is large (50 to several hundred) because communication be- 
tween processors is much slower than computation on a processor. The number 
of nonzeros per column, p, is typically in the range 30 to 100. Thus, we can not 
expect high efficiency. 

Fortunately, high efficiency is not crucial. The essential requirements are 
that the (distributed) matrix R fits in the local memories of the processors, and 
that the time for the linear algebra does not dominate the overall factorisation 
time. Because sieving typically takes much longer than linear algebra on a single 
processor, we have considerable leeway. 

For example, consider the factorisation of RSA155 (see Table [U above). We 
have n « 6.7 x 10®, p « 62. Thus ~ 4.5 x 10^®. If the communications speed 
along each row/column is a few hundred Mb/sec then we expect /3 to be a small 
multiple of 10“® seconds. Thus /3n^ might be (very roughly) 10® seconds or say 12 
days. The sieving takes 8000 Mips-years or 20 processor-years if each processor 
runs at 400 Mips. With P processors this is 20/P years, to be compared with 
12/365 years for communication. Thus, 

sieving time 600 

communication time P 

and the sieving time dominates unless P > 600. For larger problems we expect 
the sieving time to grow at least as fast as n^. 
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For more typical factorisations performed by MPQS rather than GNFS, the 
situation is better. For example, using Arjen Lenstra’s implementation of PP- 
MPQS in the Magma package, we factored a 102-decimal digit composite C 102 (a 
factor of 14^^^ — 1) on a 4 x 250 Mhz Sun multiprocessor. The sieving time was 
11.5 days or about 32 Mips- years. After combining partial relations the matrix R 
had about n = 73000 columns. Thus « 5.3 x 10®. At 100 Mb/sec it takes less 
than one minute to communicate n® bits. Thus, although the current Magma 
implementation performs the block Lanczos method on only one processor, there 
is no reason why it could not be performed on a network of small machines (e.g. a 
“Beowulf” cluster) and the communication time on such a network would only 
be a few minutes. 



7.3 Reducing Communication Time 

If our assumption of a grid/torus topology is not satisfied, then the communi- 
cation times estimated above may have to be multiplied by a factor of order 
q = y/P. In order to reduce the communication time (without buying better 
hardware) it may be worthwhile to perform some steps of Gaussian elimination 
before switching to Lanczos. 

If there are k relations whose largest prime is p, we can perform Gaussian 
elimination to eliminate this large prime and one of the k relations. (The work 
can be done on a single processor if the relations are hashed and stored on pro- 
cessors according to the largest prime occurring in them.) Before the elimination 
we expect each of the k relations to contain about p nonzeros; after the elim- 
ination each of the remaining fc — 1 relations will have about p — 1 additional 
nonzeros (assuming no “cancellation”). 

From o, the time for Lanczos is about 

T = n{az/P + (3n) , 

where z = pn is the weight (number of nonzeros) of the matrix R. The effect of 
eliminating one relation is n ^ n' = n — 1, z ^ z' = z + {p — l){k — 1) — p, 
T ^ T' = n'{az'jP + j3n') . After some algebra we see that T' < T if 

, 2/3P „ 

k < -t- 3 . 

ap 

For typical values of fi/a « 100 and p « 50 this gives the condition k < AP + 3. 

The disadvantage of this strategy is that the total weight z of the matrix 
(and hence storage requirements) increases. In fact, if we go too far, the matrix 
becomes full (see the RSA129 example in 

As an example, consider the C 102 factorisation mentioned at the end of ^ 17.21 
The initial matrix R had n 73000; we were able to reduce this to n « 32000, 
and thus reduce the communication requirements by 81%, at the expense of 
increasing the weight by 76%. Our (limited) experience indicates that this be- 
haviour is typical - we can usually reduce the communication requirements by a 
factor of five at the expense of at most doubling the weight. However, if we try 
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to go much further, there is an “explosion”, the matrix becomes dense, and the 
advantage of the Lanczos method over Gaussian elimination is lost. (It would be 
nice to have a theoretical explanation of this behaviour.) 

8 Historical Factoring Records 




I960 1970 1980 1990 2000 



Figure 3: Size of “general” number factored versus year 



2010 



Figure 3 shows the size D (in decimal digits) of the largest “general” number 
factored against the year it was done, from 1964 (20D) to April 2000 (155D) 
(historical data from [41 ildlbbj l. 



8.1 Curve Fitting and Extrapolation 

Let D be the number of decimal digits in the largest “general” number fac- 
tored by a given date. From the theoretical time bound for GNFS, assuming 
Moore’s law, we expect to be roughly a linear function of calendar year (in 
fact L)1/3(1 n£))2/3 should be linear, but given the other uncertainties we have 
assumed for simplicity that (In is roughly a constant). Figure 4 shows 

versus year Y . 

The straight line shown in the Figure 4 is 
1Q28 6 

^ or equivalently Y = 13.24:0^^^ + 1928.6 , 

13.24 

and extrapolation, for what it is worth, gives D = 309 (i.e. 1024 bits) in the year 
Y = 2018. 
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Figure 4: versus year Y 



9 Summary and Conclusions 

We have sketched some algorithms for integer factorisation. The most important 
are ECM, MPQS and NFS. As well as their inherent interest and applicability 
to other areas of mathematics, advances in public key cryptography have lent 
them practical importance. 

Despite much progress in the development of efficient algorithms, our knowl- 
edge of the complexity of factorisation is inadequate. We would like to find a 
polynomial time factorisation algorithm or else prove that one does not exist. 
Until a polynomial time algorithm is found or a quantum computer capable 
of running Shor’s algorithm is built, large factorisations will remain an 

interesting challenge. 

A survey similar to this one was written in 1990 (see P]). Comparing the 
examples there we see that significant progress has been made. This is partly 
due to Moore’s law, partly due to the use of many machines on the Internet, 
and partly due to improvements in algorithms (especially GNFS). The largest 
number factored by MPQS at the time of writing |2| had 111 decimal digits. 
According to RSAllO was factored in 1992 (by MPQS). In 1996 GNFS was 
used to factor RSA130; in August 1999, RSA155 also fell to GNFS. Progress 
seems to be accelerating. This is due in large part to algorithmic improvements 
which seem unlikely to be repeated. On the other hand, it is very hard to antic- 
ipate algorithmic improvements! 



Recent Progress and Prospects for Integer Factorisation Algorithms 



19 



From the predicted run time for GNFS, we would expect RSA155 to take 
6.5 times as long as RSA14C0. On the other hand, Moore’s law predicts that 
circuit densities will double every 18 months or so. Thus, as long as Moore’s law 
continues to apply and results in correspondingly more powerful parallel com- 
puters, we expect to get three to four decimal digits per year improvement in the 
capabilities of GNFS, without any algorithmic improvements. The extrapolation 
from historical figures is more optimistic: it predicts 6-7 decimal digits per year 
in the near future. 

Similar arguments apply to EGM, for which we expect slightly more than 
one decimal digit per year in the size of factor found j0| . 

Although we did not discuss them here, the best algorithms for the inte- 
ger discrete logarithm problem are analogous to NFS, requiring the solution of 
large, sparse linear systems over finite fields, and similar remarks apply to such 
algorithms. 

Regarding cryptographic consequences, we can say that 512-bit RSA keys 
are already insecure. 1024-bit RSA keys should remain secure for at least fifteen 
years, barring the unexpected (but unpredictable) discovery of a completely 
new algorithm which is better than GNFS, or the development of a practical 
quantum computer. In the long term, public key algorithms based on the discrete 
logarithm problem for elliptic curves appear to be a promising alternative to RSA 
or algorithms based on the integer discrete logarithm problem. 
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1 Introduction 

Given a convex polygon P in the plane and a positive integer n, we consider 
the problem of generating a length-uniform triangular mesh for the interior of 
P using n Steiner points. More specifically, we want to find both a set Sn of n 
points inside P, and a triangulation of P using Sn, with respect to the following 
minimization criteria: (1) ratio of the maximum edge length to the minimum 
one, (2) maximum edge length, and (3) maximum triangle perimeter. 

These problems can be formalized as follows: Let V be the set of vertices of 
P. For an n-point set Sn interior to P let T{Sn) denote the set of all possible 
triangulations of Sn U V. Further, let l{e) denote the (Euclidean) length of edge 
e, and let peri{A) be the perimeter of triangle A. 

Problem 1 mins^cP minTGT(s„) maxejgT ^ 

Problem 2 mins^i^P minj’gg-( 5 ^) maxegy Z(e) 

Problem 3 mins^gP minj’g 7 -( 5 ^) max/igp perf(A) 

Finding an optimal solution for any of the three problems seems to be diffi- 
cult, in view of the NP-completeness of packing problems in the plane, see e.g. 
Johnson III3!. or in view of the intrinsic complexity of Heilbronn’s triangle prob- 
lem, see m- For the case of a fixed point set, minimizing the maximum edge 
length is known to be solvable in 0{v?) time; see Edelsbrunner and Tan [Zj. 
Nooshin et al. Pg developed a potential-based heuristic method for Problem 2, 
but did not give a theoretical guarantee for the obtained solution. 

In this paper, we offer an 0{n^ log n) heuristic capable of producing con- 
stant approximations for any of the three problems stated above. Respective 
approximation factors of 6, 4\/3, and 6-\/3 are proven, provided n is reasonably 
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large. Our experiments reveal a much better behaviour, concerning the quality 
as well as the runtime. With minor modifications, our method works for arbi- 
trary polygons (with possible holes), and yields the same approximation result 
for Problem 1. Concerning Problems 2 and 3, the approximation factors above 
can be guaranteed for a restricted class of non-convex polygons. 

We first develop a heuristic we called canonical Voronoi insertion which 
approximately solves a certain extreme packing problem for point sets within P. 
The method is similar to the one used in Gonzalez jO] and Feder and Greene jS| 
developed for clustering problems. We then show how to modify the heuristic, 
in order to produce a set of n points whose Delaunay triangulation within P 
constitutes a constant approximation for the problems stated above. Note that 
the solution we construct is neccessarily a triangulation of constant vertex degree. 

Generating triangular meshes is one of the fundamental problems in compu- 
tational geometry, and has been extensively studied; see e.g. the survey article 
by Bern and Eppstein |3|. Main fields of applications are finite element meth- 
ods and computer aided design. In finite element methods, for example, it is 
desirable to generate triangulations that do not have too large or too small an- 
gles. Along this direction, various algorithms have been reported |4I1 lltil2l,^ll,4[ . 
Restricting angles means bounding the edge length ratio for the individual trian- 
gles, but not necessarily for a triangulation in global, which might be desirable 
in some applications. That is, the produced triangulation need not be uniform 
concerning, e.g., the edge length ratio of its triangles. Ghew ^ and Melisseratos 
and Souvaine m construct uniform triangular meshes in the weaker sense that 
only upper bounds on the triangle size are required. To the knowledge of the 
authors, the problems dealt with in this paper have not been studied in the field 
of computational geometry. The mesh refinement algorithms in Ghew 0 and in 
Ruppert uni are similar in spirit to our Voronoi insertion method, but do not 
proceed in a canonical way and aim at different optimality criteria. 

A particular application of length-uniform triangulation arises in designing 
structures such as plane trusses with triangular units, where it is required to de- 
termine the shape from aesthetic points of view under the constraints concerning 
stress and nodal displacement. The plane truss can be viewed as a triangulation 
of points in the plane by regarding truss members and nodes as edges and points, 
respectively. When focusing on the shape, edge lengths should be as equal as pos- 
sible from the viewpoint of design, mechanics and manufacturing; see fran) . In 
such applications, the locations of the points are usually not fixed, but can be 
viewed as decision variables. In view of this application field, it is quite natural 
to consider Problems 1, 2, and 3. 

The following notation will be used throughout. For two points x and y in 
the plane, let l{x,y) denote their Euclidean distance. The minimum (non-zero) 
distance between two point sets X and Y is defined as 1{X,Y) = min{Z(a;,?/) | 
X G X,y gY,x ^ y}. When V is a singleton set {a;} we simply write 1{X, Y) as 
l{x,Y). Note that 1{X,X) defines the minimum interpoint distance among the 
point set X. 
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2 Canonical Voronoi Insertion and Extreme Packing 

In this section, we consider the following extreme packing problem. Let P be a 
(closed) convex polygon with vertex set V. 

Maximize 1{V U Sn, V U Sn) 
subject to a set Sn of n points within P. 

We shall give a 2-approximation algorithm for this problem using canonical 
Voronoi insertion. In Section 3 we then show that the point set Sn produced by 
this algorithm, as well as the Delaunay triangulation induced by S„ within P, 
can be modified to give an approximate solution for the three problems addressed 
in Section 1. 

The algorithm determines the location of the point set Sn in a greedy manner. 
Namely, starting with an empty set S, it repeatedly places a new point inside P 
at the position which is farthest from the set U U S'. The idea of the algorithm 
originates with Gonzalez and Feder and Greene 0, and was developed for 
approximating minimax fc-clusterings. Gomparable insertion strategies are also 
used for mesh generation in Ghew 0 and in Ruppert m, there called Delaunay 
refinement. Their strategies aim at different quality measures, however, and 
insertion does not take place in a canonical manner. 

The algorithm uses the Voronoi diagram of the current point set to select 
the next point to be inserted. We assume familiarity with the basic properties 
of a Voronoi diagram and its dual, the Delaunay triangulation, and refer to the 
survey paper p. 

Algorithm INSERT 
Step 1 : Initialize S' := 0. 

Step 2: Gompute the Voronoi diagram Vor(V U S) of V U S. 

Step 3: Find the set B of intersection points between edges of Vor(V U S) and 
the boundary of P. Among the points in B and the vertices of Vor( V U S) inside 
P, choose the point u which maximizes l{u,V U S). 

Step 4: Put S := S U {u} and return to Step 2 if |S| < n. 

Let pj and Sj, respectively, denote the point chosen in Step 3 and the set 
obtained in Step 4 at the j-th iteration of the algorithm. For an arbitrary point 
X G P define the weight of x with respect to Sj as Wj{x) = l{x, Sj U V). That is, 
Wj (x) is the radius of the largest circle centered at x which does not enclose any 
point from Sj UV . By definition of a Voronoi diagram, the point pj maximizes 
Wj-i{x) over all x G P. Let 

dn = l{SnUV,SnUV) ( 1 ) 

be the minimum interpoint distance realized by Sn U V . Furthermore, denote by 
Sf the optimal solution for the extreme packing problem for P and let d* denote 
the corresponding objective value. The following approximation result might be 
of interest in its own right. Its proof is an adaptation of techniques in and 
contains observations that will be used in our further analysis. 
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Theorem 1. The solution Sn obtained by Algorithm INSERT is a 2-approxi- 
mation of the extreme packing problem for P . That is, d„ > d'^j2. 

Proof. We claim that p„ realizes the minimum (non-zero) distance from Sn to 
Sn^V. Equivalently, the claim is 

Wn-l(Pn) =KSn,Sn'JV). (2) 

To see this, assume that the minimum distance is realized by points pk and 
Pj different from Let pk be inserted after pj by the algorithm. Then we 
get Wk-i{pk) < l{pk,Pj) < l{pn,Sn-i U E) = Wn-iipn)- On the other hand, 
the sequence of weights chosen by the algorithm must be non-increasing. More 
exactly, Wk-i{pk) > Wk-i{Pn) > Wn-i{Pn)- This is a contradiction. 

The observations = min{^(S'„, S'„ U V),l{V,V)} and 1{V,V) > d„ > d„ 
now imply dn = min{?ii„_i(p„), d* } by (|2I). But pn maximizes Wn-i(x) for all 
X G P. So the lemma below (whose proof is omitted) completes the argument. 



Lemma 1. For any set S G P of n — 1 points there exists a point x G P with 
l{x,SUV) > d*n/2. 

3 Delaunay Triangulation of Bounded Edge Ratio 

Our aim is to show that Algorithm INSERT is capable of producing a point 
set appropriate for Problems 1, 2, and 3. To this end, we first investigate the 
Delaunay triangulation DT(S'„ U E) of S'„ U P. This triangulation is implicitly 
constructed by the algorithm, as being the dual structure of Vor(S'„ U V). How- 
ever, DT(S'„ U V) need not exhibit good edge length properties. We therefore 
prescribe the placement of the first k inserted points, and show that Algorithm 
INSERT completes them to a set of n points whose Delaunay triangulation has 
its edge lengths controlled by the minimum interpoint distance dn for 5'„ U E. 

3.1 Triangle types. For 1 < j < n, consider the triangulation DT(S'j U E). 
Let us classify a triangles A of DT(iS'j U E) as either critical or non-critical, 
depending on whether the Voronoi vertex dual to A (i.e., the circumcenter of 
A) lies outside of the polygon P or not. Whereas edges of critical triangles can 
be arbitrarily long, edge lengths are bounded in non-critical triangles. 

Lemma 2. No edge e of a non-critical triangle A of DT(Sj UV) is longer than 

2-Wj-i{pj). 

Proof. Let e = (p, q) and denote with x the Voronoi vertex dual to Z\. As a; 
lies inside of P, we get l{x,p) = l(x,q) = Wj-\{x) < Wj-i{pj), by the choice of 
point Pj in Step 3 of Algorithm INSERT. The triangle inequality now implies 
l{p,q) < 2 • Wj-i{pj). 
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We make an observation on critical triangles. Consider some edge e of DT(S'jUy) 
on the boundary of P. Edge e cuts off some part of the diagram Voi {Sj U V) 
that is outside of P. If that part contains Voronoi vertices then we define the 
critical region, R{e), for e as the union of all the (critical) triangles that are dual 
to these vertices. Notice that each critical triangle of DT(S'^- U V) belongs to a 
unique critical region. 

Lemma 3. No edge f of a critical triangle in R{e) is longer than /(e). 

Proof. Let p be an endpoint of /. Then the region of p in Yor{Sj U V) intersects 
e. Let a; be a point in this region but outside of P. There is a circle around x 
that encloses p but does not enclose any endpoint of e. Within P, this circle is 
completely covered by the circle C with diameter e. This implies that p lies in C. 
As the distance between any two points in C is at most l{e), we get /(/) < /(e). 

Let us further distinguish between interior triangles and non-interior ones, the 
former type having no two endpoints on the boundary of P. The shortest edge 
of an interior triangle can be bounded as follows. 

Lemma 4. Each edge e of an interior triangle A of DT(Sj U V ) has a length 
of at least Wj-i{pj). 

Proof. We have /(e) > l{Sj,Sj U V), because A has no two endpoints on P’s 
boundary. But from © we know l{Sj, Sj U U) = Wj-\{pj). 

3.2 Edge length bounds. We are now ready to show how a triangulation 
with edge lengths related to c/„ can be computed. First, Algorithm INSERT is run 
on P, in order to compute the value c/„. We assume than n is chosen sufficiently 
large to assure c/„ < l{V,V)/2. This assumption is not unnatural as the shortest 
edge of the desired triangulation cannot be longer than the shortest edge of P. 
After having c/„ available, k points p[, ... ,p'f. are placed on the boundary of P, 
with consecutive distances between 2 • and 3 ■ dn, and such that 1{V' ,V) > 
dn holds, for V’ = V U {p’l, . . . , p'ff\ . Notice that such a placement is always 
possible. Finally, n — k additional points . . . ,Pn are produced by re-running 
Algorithm INSERT after this placement. 

For 1 < j < n, let Sj = {p'^, . . . ,p' }. Define w{x) = l{x, S'^ U V) for a point 
X G P. The value of w{p'^) will turn out to be crucial for analyzing the edge 
length behavior of the triangulation DT(S'(j U U). The lemma below asserts that 
w{p'^) is small if n exceeds twice the number k of prescribed points. 

Lemma 5. Suppose n > 2k. Then w{p'„) < 3 ■ dn. 

Proof. The point set Sn produced by Algorithm INSERT in the first run is large 
enough to ensure < l(V,V). So we get dn = Wn-i{Pn) from O. As point Pn 
maximizes Wn-i(x) for all x G P, the n -|- |U| circles centered at the points in 
S'n U U and with radii dn completely cover the polygon P. Let c/„ = 1 for the 
moment. Then 



A(P) < TT{n+\V\)-A' 



( 3 ) 
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where A{P) is the area of P, and A' denotes the area outside of P which is 
covered by the circles centered at V. 

Assume now w{p'^) > 3 • Draw a circle with radius around each point 
in \ 5' . Since w{p'J = 1{S'^ \ S'^, S'„ U F) by these circles are pairwise 
disjoint. By the same reason, and because boundary distances defined by V = 
V US'f. are at most 3-d„, these circles all lie completely inside P. Obviously, these 
circles are also disjoint from the \V\ circles of radius centered at V. Finally, 
the latter circles are pairwise disjoint, since < l{V,V)j2. Consequently, 

A{P) > ^Tr{n -k) + A" (4) 

where A" denotes the area inside of P which is covered by the circles centered at 
V. Combining and and observing A' + A!' = tt ■\V\ now implies n < 2k, 
a contradiction. 



It has to be observed that the number k depends on n. The following fact guar- 
antees the assumption in Lemma 0 provided n is sufficiently large. Let B{P) 
denote the perimeter of P. 



Lemma 6. The condition dn < A{P)/{tt ■ B{P)) implies n > 2k. 
Proof. By l|3) we have 



n > 



A{P) 

T • (dn)'- 



-|y|. 



To get a bound on k, observe that at most l{e)/2d„ 
each edge e of P. This sums up to 



1 points are placed on 



B{P) 

2dn 



1 ^ 1 - 



Simple calculations now show that the condition on dn stated in the lemma 
implies n > 2k. 



The following is a main theorem of this paper. 

Theorem 2. Suppose n is large enough to assure the conditions dn < l(V,V)/2 
and dn < A{P) / {tt ■ B (P)) . Then no edge in the triangulation T~^ = DTfSfUV ) 
is longer than 6 • dn. Moreover, T+ exhibits an edge length ratio of 6. 

Proof. Two cases are distinguished, according to the value of w{p'^). 

Case 1: w{pn) < dn. Concerning upper bounds. Lemma 0 implies l{e) < 
2 ■ w{p'n) < 2 ■ dn for all edges e belonging to non-critical triangles of T+. If 
e belongs to some critical triangle. Lemma 0 shows that l{e) cannot be larger 
than the maximum edge length on the boundary of P, which is at most 3 • 
by construction. Concerning lower bounds. Lemma El gives l{e) > w{p'^) for 
edges of interior triangles. We know w{p'^) > d*/2 from Lemma 0 which implies 
l{e) > dn/2 because dn > dn. For edges spanned by V’ , we trivially obtain 
l{e) > dn as 1{V',V) > dn by construction. 
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Case 2: w{p'^) > The upper bound 2 ■w{p'^) for non-critical triangles now 
gives l{e) < due to Lemmas 0 and 0 The lower bound for interior triangles 
becomes /(e) > w{p'^) > c/„. The remaining two bounds are the same as in the 
former case. 

3.3 Computational issues. The time complexity of computing the trian- 
gulation is dominated by Steps 2 and 3 of Algorithm INSERT. In the very 
first iteration of the algorithm, both steps can be accomplished in 0(|U| log |U|) 
time. In each further iteration j we update the current Voronoi diagram under 
the insertion of a new point pj in Step 2, as well as a set of weights for the 
Voronoi vertices and relevant polygon boundary points in Step 3. 

Since we already know the location of the new point pj in the current Voronoi 
diagram, the region of pj can be integrated in time proportional to the degree 
of Pj in the corresponding Delaunay triangulation, deg(pj). We then need to 
calculate, insert, and delete 0(deg(pj)) weights, and then select the largest one 
in the next iteration. This gives a runtime of 0{deg{pj) ■ logn) per iteration. 

The following lemma bounds the number of constructed triangles, of a certain 
type. Let us call a triangle good if it is both interior and non-critical. 

Lemma 7. The insertion of each point pj creates only a constant number of 
good triangles. 

Proof. Consider the endpoints of all good triangles incident to pj in DT(5'j U 
V), and let X be the set of all such endpoints interior to P. Then 1{X,X) > 
l{Sj,Sj) > Wj-i{pj), due to 0. On the other hand, by Lemma El X lies in the 
circle of radius 2 • Wj-\{pj) around pj. As a consequence, |V| is constant. The 
number of good triangles incident to pj is at most 2 • |AT|, as one such triangle 
would have two endpoints on P’s boundary, otherwise. 

For most choices of P and n, the good triangle type will be most frequent. Note 
also that the degree of all points in the final triangulation T+ has to be constant. 

In conclusion, we obtain a runtime bound of O(n^logn) and a space com- 
plexity of 0(n). However, Lemma Q suggests a runtime of O(logn) in most 
iterations. 

4 Approximation Results 

Let us now return to the three optimization problems for the polygon P posed 
in the introduction. We will rely on Theorem E]in the following. Recall that, in 
order to make the theorem hold, we have to choose n sufficiently large. 

Theorem 3. The triangulation T~^ approximates the optimal solution for Prob- 
lem 1 by a factor of 6. 

Proof. Theorem El guarantees for T’*' an edge length ratio of 6, and for no trian- 
gulation this ratio can be smaller than 1. 
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We now turn our attention to Problem 2. Let the point set S in conjunction with 
the triangulation T of S' U P be the corresponding optimum solution. Let diong 
denote the optimum objective value, that is, diong measures the longest edge 
in T. The lemma below relates diong to the optimum value d* for the extreme 
packing problem for P. The proof is omitted in this extended abstract. 

Lemma 8. diong > 

We strongly conjecture that the statement of Lemma 0 can be strengthened 
to diong > d*, which would improve the bounds in Theorems 0 and 0 below. 

Theorem 4. The triangulation T~^ constitutes a A'Ji-approximation for Prob- 
lem 2. 

Proof. Let Cmax denote the longest edge in T+. By Theorem0we have l{emax) < 
6-d„. Trivially d„ < d* holds, and Lemma0implies the theorem, l{ema.y) / diong < 

4 \/ 5 . 

Finally let us consider Problem 3. Let dperi denote the optimum objective 
value for this problem. We show the following: 

Theorem 5. The triangulation T~^ gives a approximation for Problem 3. 

Proof. For any triangulation of P with n Steiner points, its longest edge cannot 
be shorter than ^ • d* by Lemma 0 This implies dperi > 's/S • d* by the triangle 
inequality. On the other hand, for the longest edge Cmax of T’*' we have l{emax) ^ 
6 • d* due to Theorem 0 The longest triangle perimeter 5max that occurs in T+ 
is at most 3 • l{emax)- In summary, 5max/dpe„ < 6 • VS. 

We conclude this section by mentioning an approximation result concerning 
minimum-weight triangulations. The easy proof is omitted. 

Theorem 6. Let S'^ be the vertex set of T~^ and let MWT(S~^) denote the 
minimum-weight triangulation of S~^ . Then T~^ is a 6-length approximation for 
MWT{S+). 

5 Experimental Results 

We have performed computational experiments in order to see the effectiveness 
of the proposed algorithm. For space limitations, we focus on Problem 1 and we 
only give detailed results for two typical convex polygons. The first polygon is 
rather fat while the second one is skinny and has a very long edge. The length 
of the shortest edge in both polygons is roughly equal to d„ for n = 50. We have 
tested the four cases of n = 50, 100, 200, 300. 

As we described in Section 3, Algorithm INSERT places points on the bound- 
ary in the first run, and then is restarted to produce the triangulation. Thereby, 
the consecutive distance between placed points need to be set to 2d„ ~ 3d„ in 
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points 
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edge ratio 
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2.08 
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2 ~ 


3 


3.23 


371 


392 


2.71 


335 


343 


300 


1 ~ 


2 


2.04 


567 


700 


2.1 


531 


657 




2 ~ 


3 


3.85 


567 


580 


2.5 


531 


563 



order to ensure the desired results theoretically. However, the case of ~ 2d„ is 
also tested in our experiments. The computational results are summarized below 
(fat polygon left, skinny polygon right). 

We observe that (1) the actual edge length ratio is much better than the 
worst-case ratio of 6 given by TheoremOl and (2) the number of flips per insertion 
of a point is very small (roughly two). Although the example polygons do not 
satisfy < l{V,V)/2 (as required by Theorem EJ we obtained much better 
ratios. This exhibits the practical effectiveness of the proposed algorithm. 

Note also that for a fat polygon (left) the choice of consecutive distance of 
dn ~ 2dn produces a better ratio, while for a skinny polygon (right) the contrary 
can be observed. 

6 Discussion and Extensions 

We have considered the problem of generating length-uniform triangular meshes 
for the interior of convex polygons. A unifying algorithm capable of computing 
constant approximations for several criteria has been developed. The basic idea 
has been to relate the length of triangulation edges to the optimum extreme pack- 
ing distance. The method is easy to implement and seems to produce acceptably 
good triangular meshes as far as computational experiments are concerned. 

In practical applications, more general input polygons need to be triangu- 
lated. We stress that our algorithm works with minor modification for arbitrary 
polygons with possible holes. Convexity is used solely in the proof of Lemma |HI 
As a consequence. Theorems Q and |3 the approximation result for Problem 1, 
and Theorem 0 still hold. The modification needed is that visible distances in 
a non-convex polygon P should be considered only, in the proofs as well as 
concerning the algorithm. That is, for the point sets S' C P in question, the 
Delaunay triangulation of S U U constrained by P has to be utilized rather than 
DT(SUU). 

The proof of Lemma (and with it the approximation results for Problems 2 
and 3) still go through for non-convex polygons P with interior angles of at 
most provided n is large enough to make the value -^diong fall short of 
the minimum distance between non-adjacent edges of P. We pose the question 
of establishing a version of Lemma 0 for general non-convex polygons, and of 
improving the respective bound ^ for the convex case. 
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Viewed from the point of applications to the design of structures, it is also 
important to generate a triangular mesh for approximating surfaces such as 
large-span structures. For this direction, our result concerning Problem 1 can be 
extended to spherical polygons. More precisely, given a convex polygon whose 
vertices lie on a hemisphere (or a smaller region of a sphere cut by a plane), the 
problem is to find a triangular mesh whose points are on the hemisphere that 
minimizes the objective function of Problem 1. It can be shown that the algo- 
rithm obtained by appropriately modifying Algorithm INSERT attains a 3-\/5 
approximation ratio. 

Acknowledgements. This research was partially supported by the Austrian Spezial- 
forschungsbereich F003, Optimierung und Kontrolle, and the Grant-in- Aid for Scientihc 
Research on Priority Areas (B) (No. 10205214) by the Ministry of Education, Science, 
Sports and Culture of Japan. In addition, the fifth anthor was also supported by NSF 
of China (No. 19731001) and the Japan Society for the Promotion of Science of Japan. 
We gratefnlly acknowledge all these supports. 



References 

1. F. Aurenhammer, “Voronoi diagrams - a survey of a fundamental geometric data 
structnre”, ACM Computing Surveys 23 (1991), 345-405. 

2. M. Bern, D. Dobkin and D. Eppstein, “Triangulating polygons without large an- 
gles”, Inti. J. Comput. Geom. and Appl. 5 (1995), 171-192. 

3. M. Bern and D. Eppstein, “Mesh generation and optimal triangnlation” , in D.- 
Z. Du (ed.), Computing in Euclidean Geometry, World Scientific Publishing, 1992, 
47-123. 

4. M. Bern, D. Eppstein and J.R. Gilbert, “Provably good mesh generation”, Journal 
of Computer and System Sciences 48 (1994), 384-409. 

5. M. Bern, S. Mitchell and J. Ruppert, “Linear-size nonobtuse triangulation of poly- 
gons”, Proceedings of the 10th Ann. ACM Symposium on Computational Geom- 
etry (1994), 221-230. 

6. P. Ghew, “Gnaranteed-Quality Mesh Generation for Curved Surfaces” , Proceedings 
of the 9th Ann. ACM Symposinm on Computational Geometry (1993), 274-280. 

7. H. Edelsbrnnner and T.S. Tan, “A quadratic time algorithm for the minmax length 
triangulation”, SIAM Journal on Computing 22 (1993), 527-551. 

8. T. Feder and D.H. Greene, “Optimal Algorithms for Approximate Clnstering”, 
Proceedings of the 20th Ann. ACM Symposium STOC (1988), 434-444. 

9. T. Gonzalez, “Clustering to minimize the maximnm interclnster distance”. Theo- 
retical Computer Science 38 (1985), 293-306. 

10. D.S. Johnson, “The NP-completeness column: An ongoing guide”. Journal of Al- 
gorithms 3 (1982), 182-195. 

11. E. Melisseratos and D. Souvaine, “Coping with inconsistencies: A new approach 
to produce quality triangulations of polygonal domains with holes”. Proceedings 
of the 8th Ann. ACM Symposium on Computational Geometry (1992), 202-211. 

12. H. Nooshin, K. Ishikawa, P.L. Disney and J.W. Butterworth, “The traviation pro- 
cess” , Journal of the International Association for Shell and Spatial Structures 38 
(1997), 165-175. 




Approximating Uniform Triangular Meshes in Polygons 



33 



13. M. Ohsaki, T. Nakamura and M. Kohiyama, “Shape optimization of a double-layer 
space truss described by a parametric surface”, International Journal of Space 
Structures 12 (1997), 109-119. 

14. K.F. Roth, “On a problem of Heilbronn”, Proc. London Mathematical Society 26 
(1951), 198-204. 

15. J. Ruppert, “A Delaunay Refinement Algorithm for Quality 2-Dimensional Mesh 
Generation”, Journal of Algorithms 18 (1995), 548-585. 




Maximum Induced Matchings 
of Random Cubic Graphs 



William Duckworth^, Nicholas C. Wormald^*, and Michele Zito^** 

^ Department of Mathematics and Statistics, University of Melbourne, Australia 
{billy , nick}@ms . unimelb . edu . au 
^ Department of Computer Science, University of Liverpool, UK 
micheleScsc . liv. ac .uk 



Abstract. In this paper we present a heuristic for finding a large in- 
duced matching At of cubic graphs. We analyse the performance of 
this heuristic, which is a random greedy algorithm, on random cubic 
graphs using differential equations and obtain a lower bound on the ex- 
pected size of the induced matching returned by the algorithm. The 
corresponding upper bound is derived by means of a direct expecta- 
tion argument. We prove that At asymptotically almost surely satisfies 
0.2704n < |At| < 0.2821n. 



1 Introduction 



An induced matching of a graph G = {V, E) is a set of vertex disjoint edges 
MCE with the additional constraint that no two edges of At are connected 
by an edge of E \ M. We are interested in finding induced matchings of large 
cardinality. 

Stockmeyer and Vazirani El introduced the problem of finding a maximum 
induced matching of a graph, motivating it as the “risk-free marriage problem” 
(find the maximum number of married couples such that each person is com- 
patible only with the person (s)he is married to). This in turn stimulated much 
interest in other areas of theoretical computer science and discrete mathematics 
as finding a maximum induced matching of a graph is a sub-task of finding a 
strong edge-colouring of a graph (a proper colouring of the edges such that no 
edge is incident with more than one edge of the same colour as each other, see 
(for example) [tblblhllDj l. 

The problem of deciding whether for a given integer k a given graph G 
has an induced matching of size at least k is NP-Complete El, even for bi- 
partite graphs of maximum degree 4. It has been shown FUTTlj that the prob- 
lem is APX-complete even when restricted to fcs-regular graphs for k € {3,4} 
and any integer s > 1. The problem of finding a maximum induced matching 
is polynomial-time solvable for chordal graphs |2| and circular arc graphs [Z|. 
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Recently Golumbic and Lewenstein [Sj have constructed polynomial-time algo- 
rithms for maximum induced matching in trapezoid graphs, interval-dimension 
graphs and co-comparability graphs, and have given a linear-time algorithm for 
maximum induced matching in interval graphs. 

In this paper we present a heuristic for finding a large induced matching 
A4 of cubic graphs. We analyse the performance of this heuristic, which is a 
random greedy algorithm, on random cubic graphs using differential equations 
and obtain a lower bound on the expected size of the induced matching returned 
by the algorithm. The corresponding upper bound is derived by means of a 
direct expectation argument. We prove that A4 asymptotically almost surely 
satisfies 0.2704n < |A4| < 0.2821n. Little is known on the complexity of this 
problem under the additional assumption that the input graphs occur with a 
given probability distribution. Zito ra presented some simple results on dense 
random graphs. The algorithm we present was analysed deterministically in Pj 
where it was shown that, given an n-vertex cubic graph, the algorithm returns 
an induced matching of size at least 3n/20-|-o(l), and there exist infinitely many 
cubic graphs realising this bound. 

Throughout this paper we use the notation P (probability), E (expectation), 
u.a.r. (uniformly at random) and a.a.s. (asymptotically almost surely). When 
discussing any cubic graph on n vertices, we assume n to be even to avoid parity 
problems. 

In the following section we introduce the model used for generating cubic 
graphs u.a.r. and in Sectional we describe the notion of analysing the performance 
of algorithms on random graphs using a system of differential equations. Section 0 
gives the randomised algorithm and Section Ogives its analysis showing the a. a. 
sure lower bound. In Section Owe give a direct expectation argument showing 
the a. a. sure upper bound. 

2 Uniform Generation of Random Cubic Graphs 

The model used to generate a cubic graph u.a.r. (see for example Bollobas P) 
can be summarised as follows. For an n vertex graph 

— take 3n points in n buckets labelled 1 . . . n with three points in each bucket 

and 

— choose u.a.r. a perfect matching of the 3n points. 

If no edge of the matching contains two points from the same bucket and no two 
edges contain four points from just two buckets then this represents a cubic graph 
on n vertices with no loops and no multiple edges. With probability bounded 
below by a positive constant, loops and parallel edges do not occur p. The 
edges of the random cubic graph generated are represented by the edges of the 
matching and its vertices are represented by the buckets. 

We may consider the generation process as follows. Initially, all vertices have 
degree 0. Throughout the execution of the generation process, vertices will in- 
crease in degree until the generation is complete and all vertices have degree 3. 
We refer to the graph being generated by this process as the evolving graph. 
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3 Analysing Algorithms Using Differential Equations 

We incorporate the algorithm as part of a matching process generating a random 
cubic graph. During the generation of a random cubic graph, we choose the 
matching edges sequentially. The first end-point of a matching edge may be 
chosen by any rule, but in order to ensure that the cubic graph is generated 
u.a.r., the second end-point of that edge must be selected u.a.r. from all the 
remaining free points. The freedom of choice of the first end-point of a matching 
edge enables us to select it u.a.r. from the vertices of given degree in the evolving 
graph. 

The algorithm we use to generate an induced matching of cubic graphs is a 
greedy algorithm based on choosing a vertex of a given degree and performing 
some edge and vertex deletions. In order to analyse our algorithm using a system 
of differential equations, we generate the random graph in the order that the 
edges are examined by the algorithm. 

At some stage of the algorithm, a vertex is chosen of given degree based upon 
the number of free points remaining in each bucket. The remaining edges incident 
with this vertex are then exposed by randomly selecting a mate for each spare 
point in the bucket. This allows us to determine the degrees of the neighbours 
of the chosen vertex and we refer to this as probing those vertices. This is done 
without exposing the other edges incident with these probed vertices. Once the 
degrees of the neighbours of the chosen vertex are known, an edge is selected to 
be added to the induced matching. Further edges are then exposed in order to 
ensure the matching is induced. More detail is given in the following section. 

In what follows, we denote the set of vertices of degree i of the evolving 
graph by Vi and let Yi{= U(t)) denote |E| (at time t). The number of edges 
in the induced matching at any stage of the algorithm (time t) is denoted by 
M(= M{t)) and we let s(= s{t)) denote the number of free points available in 
buckets at any stage of the algorithm (time t). Note that s = X)i=o(^ ~ 

E(Z\X) denote the expected change in a random variable X conditional upon 
the history of the process. 

One method of analysing the performance of a randomised algorithm is to 
use a system of differential equations to express the expected changes in vari- 
ables describing the state of an algorithm during its execution (see ^2] for an 
exposition of this method). 

We can express the state of the evolving graph at any point during the 
execution of the algorithm by considering Yq, Yi and U- In order to analyse 
our randomised algorithm for finding an induced matching of cubic graphs, we 
calculate the expected change in this state over one time step in relation to 
the expected change in the size M of the induced matching. We then regard 
E(Z\li)/E(Z\M) as the derivative dU/dM, which gives a system of differential 
equations. The solution to these equations describes functions which represent 
the behaviour of the variables U. There is a general result which guarantees 
that the solution of the differential equations almost surely approximates the 
variables Yi. The expected size of the induced matching may be deduced from 
these results. 
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4 The Algorithm 

In order to find an induced matching of a cubic graph, we use the following 
algorithm. We assume the generated graph G to be connected (since random 
cubic graphs are connected a.a.s.). Then, at any stage of the algorithm after the 
first step and before its completion, W + I2 > 0 . The degree of a vertex v in 
the evolving graph is denoted by deg{v). We denote the initial set of 3 n points 
by P. B{p) denotes the bucket that a given point p belongs to and we use q{b) 
to denote the set of free points in bucket b. The set of induced matching edges 
returned is denoted by Ai. Here is the algorithm; a description is given below. 

select (pi, P); 

expose(pi,p2); 

isolate(P(pi),P(p2)); 

M ^ {B{pi),B{p2))-, 
while {Y\ + I2 > 0 ) 

{ if {Y2 > 0 ) 

{ select(u, V2); {pi} ^ q{u)] 
expose(pi,p2); u^P(p2);} 
else 

{ select(u, Vi); {pi,p2\ ^ g(u); 
expose(pi,p3); a ^ B{ps); 
expose(p 2 ,P 4 ); b ^ B{p 4 ); 
if {deg{a) > deg{b)) u <— a; 

else if (deg{b) > deg{a)) u <— 6 ; 
else select(u, {a, 6 });} 

isolate(M, u); 

M. ^ M. U (m, u); 

} 



The function select(s, S) involves the process of selecting the element s u.a.r. 
from the set S. The function expose(pj,pj) involves the process of deleting 
the selected point pi from P, exposing the edge {B{pi), B{pj)) by randomly 
selecting the point pj from P and deleting the point pj from P. The function 
isolate(i?i, P2) involves the process of randomly selecting a mate for each free 
point in the buckets Bi and B2 and then exposing all edges incident with these 
selected mates. This ensures that the matching is induced. 

The first step of the algorithm involves randomly selecting the first edge of the 
induced matching and exposing the appropriate edges. We split the remainder 
of the algorithm into two distinct phases. We informally define Phase 1 as the 
period of time where any vertices in V2 that are created are used up almost 
immediately and I2 remains small. Once the rate of generating vertices in V2 
becomes larger than the rate that they are used up, the algorithm moves into 
Phase 2 and the majority of operations involve selecting a vertex from V2 and 
including its incident edge in the induced matching. 
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There are two basic operations performed by the algorithm. Type 1 refers to 
a step when I 2 > 0 and a vertex is chosen from V 2 - Similarly, Type 2 refers to 
a step where I 2 = 0 and a vertex is chosen from Vi . 

Figs. Q and I2 show the configurations that may be encountered performing 
operations of Type 1 and Type 2 respectively (a.a.s.). For Type 1, we add the 
edge incident with the chosen vertex. For Type 2, if (after exposing the edges 
incident with the chosen vertex u) exactly one of the neighbours of u has exactly 
two free points, we add the edge incident with this vertex and the chosen vertex. 
Otherwise we randomly choose an edge incident with the chosen vertex. 




Fig. 1. Selecting a vertex from V 2 and adding its incident edge to A4. 




2a 2b 2c 



Fig. 2. Selecting a vertex from V\ and choosing an edge to add to M. 



The larger circles represent buckets each containing 3 smaller circles repre- 
senting the points of that bucket. Smaller circles coloured black represent points 
that are free. Used points are represented by white circles. Points which are not 
known to be free at this stage of the algorithm are shaded. 

In all cases, the selected vertex is labelled u and the other end-point of the 
induced matching edge chosen is labelled v. A vertex labelled v* denotes that a 
random choice has been made between 2 vertices and this one was not selected. 
After selecting a vertex u of given degree, the edges incident with this vertex are 
exposed. Once we probe the degrees of the neighbours of this vertex, we then 
make the choice as to which edge to add to the induced matching. Only then 
are other edges exposed. Therefore, at this stage, we do not know the degrees 
of all the vertices at distance at most two from the end-points of the selected 
induced matching edge. A vertex whose degree is unknown is labelled either w 
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or p. A vertex labelled p will have one of its incident edges exposed and will 
subsequently have its degree increased by one. We refer to these vertices as incs. 
A vertex labelled w will have all of its incident edges exposed and we refer to 
these vertices as rems. Should any rem be incident with other vertices of unknown 
degree then these vertices will be incs. 

Once the choice of induced matching edge has been made, all edges incident 
with the endpoints of the induced matching edge are exposed and subsequently 
the edges of the neighbours of the end-points of the chosen edge are exposed. 
This ensures the matching is induced. 

5 The Lower Bound 

Theorem 1. For a cubic graph on n vertices the size of a maximum induced 
matching is asymptotically almost surely greater than 0.2704n 

Proof. We define a clutch to be a series of operations in Phase i involving the 
selection a vertex from Vi and all subsequent operations up to but not including 
the next selection of a vertex from Vi . Increment time by 1 step for each clutch of 
vertices processed. We calculate Ei{AYi) and Ei{AM) for a clutch in each Phase. 

5.1 Preliminary Equations for Phase 1 

The initial step of Phase 1 is of Type 2 (at least a.a.s.). We consider operations 
of Type 1 first and then combine the equations given by these operations with 
those given by the operations of Type 2. 

Operations of Type 1 involve the selection of a vertex u from V 2 (which 
has been created from processing a vertex from Vi). The expected change in 
I 2 is negligible and can assumed to be 0 since this is phase 1 and I 2 remains 
small as vertices of V 2 are used up almost as fast as they are created. The 
desired equation may be formulated by considering the contribution to E{AYi) 
in three parts; deleting v (the neighbour of u), deleting the neighbour(s) of v 
and increasing the degree of the other neighbours of these neighbours by one. 

Let Pi denote the contribution to E{AYi) due to changing the degree of an 
inc from i to i + 1 and we have 

(z - 3)Yi + (4 - i)Yi_i 

Pi = ■ 

s 

This equation is valid under the assumption that F_i = 0. 

We let p,i = pi{f) denote the contribution to E{AYi) due to a rem and all its 
edges incident with incs and we have 

(z-3),^ , { 6 Y 0 + 2 Y,) 

Pi = L -I Pj. 

s s 

Let ai = ai{t) denote the contribution to E{AYi) for an operation of Type 
1 in Phase 1. We have 

(»-3) ^, + ( ^yo + 2 Yi 




CXi — 



s 



s 
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We now consider operations of Type 2. Let (ih,i{= Ph,i{t)) denote the contri- 
bution to E(Z\li) for operation 2h given in Fig.|2l (at time t). We will also use 
Sij = 1 ii i = j and Sij = 0 otherwise. 

We have 

/5a,i — 4“ /tj 4“ Pi, 

Pb^i — ^iO 4“ Pi 4“ 2pi, 

Pc,i = ~2(5io ~ 4- 2pi -|- 2pi. 

These equations are formulated by considering the contribution to E(Z\Fi) by 
deletion of vertices of known degree and the contribution given by the expected 
degree of vertices of unknown degree. 

For an operation of Type 2 in Phase 1, neighbours of u (the vertex selected 
at random from Vi) are in {Vb U Vi}, since >2 = 0 when the algorithm performs 
this type of operation. The probability that these neighbours are in Vq or Vi 
are 31b /s and 2Yi/s respectively. Therefore the probabilities that, given we are 
performing an operation of Type 2 in Phase 1, the operation is of type 2a, 2b or 
2c are given by P(2a) = P(26) = and P(2c) = respectively. 

We define a birth to be the generation of a vertex in V 2 by processing a vertex 
of W. Let vi{= be the expected number of births from processing a vertex 

from Vi (at time t). Then we have 

P2 + + P(26) (^P2 + ^) + P(2c) (^2p2 + ^ 




Here, for each case, we consider the probability that vertices of degree one (in 
the evolving graph) become vertices of degree two by exposing an edge incident 
with the vertex. 

Similarly, we let 1^2 = ^ 2 {t) be the expected number of births from processing 
a vertex from V 2 ■ Then we have 



V2 



1 ^ 61-0 + 21 - 1 ^ 



M2, 



giving the expected number of births in a clutch to be J^i/(1 — V 2 )- 

For Phase 1, the equation giving the expected change in lb for a clutch is 
therefore given by 

E(Zilb) = P(2a)/4a,* + P(26)/3b,, + P(2c)/3,,, + -^a,. 

I - V2 

The equation giving the expected increase in M for a clutch is given by 

E(Z\M) = 1 + 

\ — V2 

since the contribution to the increase in the size of the induced matching by the 
Type 2 operation in a clutch is 1. 
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5.2 Preliminary Equations for Phase 2 

In Phase 2, all operations are considered to be of Type 1 and therefore a clutch 
consists of one step, but we must also consider the expected change in Y 2 since 
this is no longer a negligible amount. The expected change in Yi is given by 

E(Z\r,) = a, - (5,2 

where remains the same as that given for Phase 1 and the expected increase 
in M is 1 per step. 



5.3 The Differential Equations 

The equation representing E(Z\li) for processing a clutch of vertices in Phase 
1 forms the basis of a differential equation. Write Yi{t) = nzi{tln), = 

nTi{t/n), Pj,i(t) = s(t) = n^(t/n), ai(t) = nxi{t/n) and Vj(t) = 

nujj{tln). The differential equation suggested is 



/ ^^1 / 



12zqZi 



I , 

V’b.i “I 'prVc,i + 

C 



1 — W2 



Xi 



(*€{0,1}) 



where differentiation is with respect to x and xn represents the number of 
clutches. From the definitions of /i, j3, s, a and v we have 



_ _ (i-3) _ I (6zo+2zi)((i-3)zi + (4-i)zi_i) 

•i — ^ ■ 2 'z “r ^2 5 



V;,,, = -3(5,1 + R + 



V'b.i 


0 

1 

II 


-2,5, 


il+T, + 2^- 


3)zi + (4- 


5 




II 

1 

to 

0 

1 


ii H — l“2Ti + 2 


(i— 3)zi + (4— i)z,_i 


? 


= 


3 ( 3 - 


i)zi, 






X* 


_ b-3) 


Zi + 


6zo+2zi _ 

^ •l') 






Wi 




T2 + 


2zi \ 1 122021 


■ (t2 + 


4zi \ , 4zJ 


W2 


_ 6zo+2zi _ 









Using the equation representing the expected increase in the size of M after 
processing a clutch of vertices in Phase 1 and writing M{t) = nz(tjn) suggests 
the differential equation for z as 



U = 1 + 



^1 

1 — W2 
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We compute the ratio ^ = 



z'(x) 



and we have 



c2 Ya 






9^n 



1 + 



1^1 

1— o;2 



(*e{o,i}) 



where differentiation is with respect to z and all functions can be taken as 
functions of z. 

For Phase 2 the equation representing E(Z\li) for processing a clutch of 
vertices suggests the differential equation 



z'i = Xi-^i2 (0<i<2). 



( 1 ) 



The increase in the size of the induced matching per clutch of vertices processed 
in this Phase is 1, so computing the ratio ^ gives the same equation 

as that given in ( 0 - Again, differentiation is with respect to z and all functions 
can be taken as functions of z. 

The solution to this system of differential equations represents the cardinal- 
ities of the sets Vi for given M (scaled by ^). For Phase 1, the initial conditions 
are 

zo(0) = 1, z,(0) = 0 (* > 0). 

The initial conditions for Phase 2 are given by the final conditions for Phase 1. 

Wormald H2! describes a general result which ensures that the solutions to 
the differential equations almost surely approximate the variables Yi . It is simple 
to define a domain for the variables Zi so that Theorem 5.1 from tI2[ may be 
applied to the process within each phase. An argument similar to that given 
for independent sets in H2I or that given for independent dominating sets in gj 
ensures that a.a.s. the process passes through phases as defined informally, and 
that Phase 2 follows Phase 1. Formally, Phase 1 ends at the time corresponding 
to W 2 = 1 as defined by the equations for Phase 1. Once in Phase 2, vertices in 
V 2 are replenished with high probability which keeps the process in Phase 2. 

The differential equations were solved using a Runge-Kutta method, giving 
W 2 = 1 at z = 0.1349 and in Phase 2 Z 2 = 0 at z > 0.2704. This corresponds 
to the size of the induced matching (scaled by when all vertices are used up, 
thus proving the theorem. □ 



6 The Upper Bound 

Theorem 2. For a cubic graph on n vertices the size of a maximum induced 
matching is asymptotically almost surely less than 0.2821n 

Proof. Consider a random cubic graph G on n vertices. Let M(G,k) denote 
the number of induced matchings of G of size k. We calculate E(M(G, k)) and 
show that when k > 0.2821n, E(A/(G, k)) = o(l) thus proving the theorem. Let 
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E(M(G,fc)) < 



N{k) 3' 



2fc 



(3(n-2fc))! N{^-5k) 

N{^) 



(3n- lOfc)! 
n! 3^^ {3{n-2k)y. (f^)! 

( 3n 



k\ {n-2k)\ {f-5ky. (3n)! 

Approximate using Stirling’s formula and re-write using f{x) = k = k/n 
and we have 



E(M(G, k))-^ 



3^^^ /(3(1 - 2^)) /(|) 2^'^ 
/(k) /(I - 2k) /(§ - 5k) /(3) ■ 



Solving this we find that for k > 0.2821 the expression on the right tends to 0. 

□ 



Note that this bound may be improved by counting only maximal matchings. 
However the improvement is slight and we do not include the details here for 
reasons of brevity. 
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Abstract. Arrangements of lines in the plane and algorithms for com- 
puting extreme features of arrangements are a major topic in compu- 
tational geometry. Theoretical bounds on the size of these features are 
also of great interest. Heilbronn’s triangle problem is one of the famous 
problems in discrete geometry. In this paper we show a duality between 
extreme (small) face problems in line arrangements (bounded in the unit 
square) and Heilbronn-type problems. We obtain lower and upper com- 
binatorial bounds (some are tight) for some of these problems. 



1 Introduction 

The investigation of arrangements of lines in the plane has attracted much atten- 
tion in the literature. In particular, certain extremal features of such arrange- 
ments are of great interest. Using standard duality between lines and points, 
such arrangements can be mapped into sets of points in the plane, which have 
also been studied intensively. In this dual setting, distributions in which certain 
features (defined by triples of points) assume their maxima are often sought. 
In this paper we show a connection between these two classes of problems and 
summarize the known bounds for some extremal-feature problems. 

Let A{C) be an arrangement of a set £ of n lines. We assume the lines of 
A to be in general position, in the sense that no two lines have the same slope. 
Thus every triple of lines define a triangle. Let U = [0,1]^ be the unit square. 
An arrangement A is called narrow if all its lines intersect the two vertical sides 
of U . A narrow arrangement A is called transposed if the lines of A intersect the 
vertical sides of U in two sequences that, when sorted in increasing y order, are 
the reverse of each other. Clearly, every transposed arrangement is also narrow. 
Note that all the vertices of a transposed arrangement lie in U. Later in the paper 
we will also define the set of convex arrangements which is a proper subset of 
the set of transposed arrangements. 

In this paper we investigate the “size” of triangles defined by the lines of 
narrow arrangements, according to several measures of the size of a triangle. We 
consider the arrangements in which the minimum size of a triangle assumes its 
maximum, and attempt to bound this value. 
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Heilbronn’s triangle problem is the following: 



Let {-Pi, P 2 , ■ • ■ , Pri} be a set of n points in U, such that the minimum 
of the areas of the triangles PiPjPk (for l<i<j<k<n) assumes its 
maximum possible value Qo(n). Estimate Go{n). 



Heilbronn conjectured that Go{n) = 0(;^). The first nontrivial upper bound 
(better than O (;{■)), namely, 0( — , ^ ), was given by Roth H. Schmidt El 

improved this result twenty years later and obtained Go{n) = 0{ — / ). Then 

n-^logn 

Roth improved the upper bound twice to O( „i.io 5 ... ) 13 and 0( „i.A7... ) |H1- The 
best currently-known upper bound, Go{n) = 0{ 8/r-s ) = 0( i.Aa... ) (for any 



e > 0), is due to Komlos, Pintz, and Szemeredi P] by a further refinement of the 
method of EE. A simple probabilistic argument by Alon et al. 0 p. 30] proves 
a lower bound of 17(^). Erdos P, appendix] showed the same lower bound by an 
example. However, Komlos, Pintz, and Szemeredi P show by a rather involved 
probabilistic construction that Go{n) = In a companion paper 0 we 

show a lower bound for the generalization of Heilbronn’s triangle problem to 
higher dimensions. 

In this paper we consider some variants of Heilbronn’s problem, in which 
other measures of triangles defined by triples of points are considered, and/or 
some restrictions are imposed on the locations of the points in U . Specifically, 
in a monotone decreasing distribution of points, the x-coordinates and the y- 
coordinates of the points appear in opposite permutations, and a convex mono- 
tone decreasing distribution is a monotone decreasing distribution in which the 
points form a convex (or concave) chain according to the respective permutation. 



2 The Duality and the Problems 

Let A be a narrow arrangement of n lines. We define two measures of triangles. 
Let Fi{t) be the vertical height of a triangle r, that is, the maximum length 
of a vertical segment contained in r. Let F 2 {t) be the area of r. Our goal is 
to bound Fi(n) = max^ min,-gyi -F’i(r) and F 2 {n) = max _4 min,-gyi -F 2 (t), where 
the maximum is taken over all narrow arrangements A of n lines, and where the 
minimum is taken over all triangles r formed by triples of lines in A. We use the 
superscripts “(mon)” and “(conv)” for denoting the monotone decreasing and 
the convex monotone decreasing variants, respectively. 

Denote by li (resp., ri) the y-coordinate of the intersection point of the line 
£i G C with the left (resp., right) vertical side of U. We dualize the line it 
to the point Pi = {li,ri). This dualization maps the measures F\ and F 2 of 
lines into measures of triples of points in U . The two optimization problems are 
mapped into generalizations of Heilbronn’s problem, where the difference is in the 
definition of the measure of a triple of points. Note that the dual of a transposed 
arrangement of lines is a monotone decreasing set of points. The measures G\ 
and G 2 (defined below) crucially rely on the decreasing monotonicity of the 
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points. A “convex” arrangement is an arrangement whose dual set of points lies 
in convex position. 

It is easy to see that the vertical height of a triangle is the minimum length 
of a vertical segment that connects a vertex of the triangle to the line supporting 
the opposite edge. This follows from the convexity of a triangle, and indeed that 
segment always lies inside the triangle. We now specify (in terms of the dual rep- 
resentation) the vertical distance between the intersection of two lines to a third 
line of A (in the primal representation). Refer to Figure The equation of £i is 



y = {ri — li)x + li. We compute the distance between Qi^k = {xi,k, Vi,k), the inter- 
section point of £i and £k, and Qi^k\j = {xi^k,yi,k\j)j the vertical projection of Qi^k 
on £j. A simple calculation shows that 



1 




0 



1 



Fig. 1. Vertical distance 



By substituting Xi^k in the equation of £j we find that yi^k\j 
Finally, 



_ rj{lk-h)-lj{rk-rj) 



J£)ist{Qi^ki Qi^k\j) \Vi,k I 



Xi{lk ^j) Xj(^lk li) Xk{^j li) 



{h - k) + {Xi - Tk) 




The numerator of the last term is the area of the triangle defined by the points 
Pi, Pj, and Pk- In case Pi and Pk are in monotone decreasing position, the 
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denominator is the perimeter of the axis-aligned box defined by Pi and Pk- 
When setting Gi{Pi,Pj,Pk) = Dist(Qi_fc, maximizing the smallest value 
of Fi over the triangles defined by triples of lines of A dualizes to maximizing 
the smallest value of Gi over triples of points in U . We denote the asymptotic 
value by ^i(n) = Pi{n). 

We now compute (in the dual representation) the area of the triangle defined 
by three lines of A (in the primal representation). Recall that 



Qi,j 



(XiJ, Uij) 




"J "3' I “I'' 3 \ 

k) - {rj - n) ’ {Ij - k) - {rj - n) 



Tedious computation shows that 



Area((^i ^ abs 



Ui-ij 1 
^j,k Vj,k 1 
Xk,i Vk,i 1 



— 2 Vj,kXk,i Xi^jUk^i T yi^jXk^i T Xi^j]Jj^k Vi^jXj^k') 



= - abs I 



= 16 abs I 



{krj - IjTi + Ikn - hvk + IjTu - IkTjY 



{{Ij - k) + {n - rj)){{lk - k) + {n - rk)){{lk - Ij) + {rj - Vk)) 



h D 1 
h rj 1 
h Tk 1 



2{{lj-k) + {ri-Tj)) ■ 2{{lk-k) + {ri-Tk)) ■ 2{{lk~lj) + {rj-Vk)) 






The numerator of the last term is the square of the area of the triangle defined by 
the points Pi, Pj, and Pk- In case Pi, Pj, and Pk are in monotone decreasing po- 
sition, the denominator is the product of the perimeters of the axis-aligned boxes 
defined by Pi, Pj, and Pk- When setting G 2 {Pi,Pj,Pk) = Are&{Qij,Qj^k,Qk,i), 
maximizing the smallest value of F 2 over the triangles defined by triples of lines 
of A dualizes to maximizing the smallest value of G 2 over triples of points in U . 
We denote the asymptotic value by G 2 {n) = p 2 {n)- 

In summary, we define a duality between narrow arrangements of lines and 
distributions of points in the unit square. We define measures of triangles defined 
by triples of lines of the arrangements, and find their dual measures of triples 
of points. We consider some special structures of the arrangements and their 
dual distributions of points. In all cases we look for the arrangement of lines (or 
distribution of points) that maximizes the minimum value of the measure over 
all triples of lines (or points), and attempt to asymptotically bound this value. 



3 Heilbronn’s Triangle Problem 

As noted in the introduction, the best known lower bound for the original Heil- 
bronn’s triangle problem, Qo{n) = J7(^^^), is obtained by Komlos, Pintz, and 
Szemeredi The same authors obtain in Pj the best known upper bound, 
0(;pTtW— )■ 
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Unlike the measures G\ and G 2 , it is not essential to require the set to be 
decreasing in the monotone case. Here we have only an upper bound: 

Theorem 1. 

Proof. Refer to the sequence of segments that connect pairs of consecutive 
points. The total length of the sequence is at most 2. Cover the whole sequence 
of segments by at most | squares with side ^ . Thus at least one square contains 
three points. The area of the triangle that they define is at most 

For the convex case the bound is tight: 

Theorem 2. = 6>(;^). 

Proof. The lower bound is obtained by a simple example: Put n points equally 
spaced on an arc of radius 1 and of length | (see Figure 0. The points are 



1 






Fig. 2. Uniform distribution of points on an arc 



ith point 

■ -? 




ordered from bottom-right to top-left. The coordinates of the ith point are 
(cos(2zi5), sin(2i(5)), for 1 < z < n, where S = 'KjifPn'). The area of the trian- 
gle defined by every three consecutive points is 4cos(<5) sin^(i5) = 0{:^). 

It is also easy to prove the upper bound. Refer to the sequence of segments 
that connect pairs of consecutive points. Drop all the segments of length greater 
than 8/n. Since the maximum length of such a convex and monotone chain is 2, 
less than n/4 segments are dropped. Now drop all pairs of consecutive segments 
whose external angle is greater than Air/n. Since the maximum turning angle of 
such a convex chain is tt/2, less than n/8 pairs are dropped. That is, less than n/4 
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segments are now dropped. In total we have dropped less than n/2 segments; 
therefore two consecutive segments have not been dropped. The area of the 
triangle which these two segments define is upper bounded by |(f)^sin(^) = 



4 Vertical Height 



Theorem 3. Q\{n) = 0{^). 

Proof. The lower bound is shown by an example 0 Set ^ - V and n = 
for 1 < i < n (a set of almost-parallel lines). In this example 



Gi{P,,Pj,Pk) 



{j — — {k — -I- (fc — 

ri3+'^{vf — n}) 



which is minimized when i, j, and k are consecutive integers. It is easy to verify 
that in this example, for every i, Gi{Pi, Pi+i, Pi+ 2 ) = n{n+i) ■ Hence Gi{n) = 

For the upper bound we return to the primal representation of the problem 
by an arrangement Aofn lines. Since A is narrow, there exists a vertical segment 
of length 1 which stabs all the n lines. Hence there exists a triple of lines which 
are stabbed by a vertical segment of length at most . Such a triple cannot 
define a triangle whose vertical height exceeds therefore !Fi{n) = 0{^). 

We now refer to transposed arrangements of lines (the dual of monotone 
decreasing Heilbronn’s sets of points). The best upper bound of which we are 
aware is Mitchell’s |5|. Here is a simplified version of the proof of this bound. 
Let 0i denote the slope of £i (for 1 < i < n). Assume without loss of generality 
that at least n/2 lines of C have positive slopes. For the asymptotic analysis we 
may assume that all the lines of C are ascending. Denote by h the minimum 
vertical height of a triangle in A{C). Then each pair of lines £i,£j G induces 
a hexagon of area ^ Figure 0 through which no other line of 

C can pass. This is since such a line would form with £i and tj a triangle whose 
vertical height is at most h/2. The intersection of every pair of such hexagons is 
empty, for otherwise there would be a triangle whose vertical height is less than 
h. Denote by S the total area of the ( 2 ) hexagons. On one hand. 



S = 



E 

l<2<j><n 



ih? cos{9i)cos{9j) 

4sin(0,— 0i) ~ 8 ^ 



1 



> 



?>hf 



sm{9j — 9i) 8 



l<2<j<n 



1 



where we use the facts that 0 < 0^ < tt/ 4 for 1 < i < n and sin(0) < 0. 



^ Note that this construction is monotone and even convex, and yet it beats the O(^) 
upper bound of C/J'’°^'"^(n). This is because it is an increasing construction. 
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Fig. 3. Hexagonal forbidden zones induced by intersection of lines 



Lemma 1. J2i<i<j<n 5 “^ = ^{n'^logn). 

Proof. Omitted in this short version of the paper. 

We thus have S = Q{hf"nf logn). On the other hand, the total area of all the 
hexagons along one line of C cannot exceed h (the area of a strip of vertical height 
h clipped to the unit square). By summing up for all the lines of C we obtain 
2S < hn. The combination of the two inequalities implies that h = 0(— — ). 
(Note that the “dropped-segments” argument gives in this case a weaker upper 
bound of 0(i).) Thus we have the following: 



Theorem 4. = 0(;n^)- 

Finally we consider convex arrangements (the dual of convex monotone de- 
creasing Heilbronn’s sets). As with the convex case of the original Heilbronn’s 
triangle problem, the bound in this case is tight: 



Theorem 5. C/|“"’''’(n) = 0 {^). 

Proof. For the lower bound we use the same example (points on an arc) as in the 
proof of Theorem El Clearly, the perimeter of the bounding box of every triangle 
defined by three consecutive points in this example is 0(-). Since the area of 
each such triangle is 0(^) we obtain the lower bound. For the upper bound we 
follow again the argument of dropping segments. The perimeter of the bounding 
box of the triangle defined by the two remaining consecutive segments is linear 
in the length s of the longer segment. The quotient of the area of the triangle 
and s is upper bounded by i(^)sin(^) = 6>(;^). 



Table Q] shows a few examples which show the lower bound = 



A Duality between Small-Face Problems 



51 



h 


Ti 


Gi{P,Pj,PA 


i 




1 1 


n 


1 n(i+k+n) 1 


i 


1 ^ 




n 


V n 


1 {y/l+Vk+y/n)y/n ' 


sin(^) 

k n 7 


COs(i^) 

' 71 ' 


1 )-Hsin( — ) i 


45- ) iv ) + sin( TV ) 



Table 1. Convex line arrangements which show that G 



(conv) 

1 



(n) 






5 Area 



The function G 2 {n) is as high as we like: 



Theorem 6. G 2 {n) = l7(/(n)) for any f{n). 

Proof. We use the construction = ^ — and ri = ^ (for 1 < i < n and for 
an arbitrarily large M). In this example 

[O' ~ — {k — + {k — 

which is minimized by i = 1, j = 2, and fc = 3. In this example G 2 {Pi, P 2 , P 3 ) = 
" ^ 2 n‘^+i ^ ~ * 2 logra^ + claim follows. 

We now refer to transposed arrangements of lines (the dual of monotone 
decreasing Heilbronn’s sets of points) . 

Theorem 7. G^°"'\ri) = 0{ — i ). 

Proof. We follow an argument similar to that used in the proof of Theorem El 
Let again 9i denote the slope of £i (for 1 < i < n), and assume without loss of 
generality that all the lines of C are ascending. Denote by A the minimum area 
of a triangle in A{C). Here the “forbidden zone” induced by each pair of lines 
G £ is a rectangle whose diagonal is of length (refer to Figure^. 

Indeed, if any other line of C passed through this rectangle, then together with 
£i and £j it would form a triangle whose area is less then A/2. The intersection 
of every pair of such rectangles is empty, for otherwise there would be a triangle 
whose area is less than A. The area of the forbidden rectangle is Let 

again S denote the total area of the ( 2 ) forbidden zones. Here we have: 



G2{P^,PJ,Pk) = 



S = 



E 



2A^ 

sin{9j — 9i) 



G{A^n^ logn), 



by using again Lemma ^ Since S' < 1, we obtain A = 0( 



n 



1 



^/iogn 



) 
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Fig. 4. Forbidden zones for ^ 2 ™™^ 



Finally we consider convex arrangements (the dual of convex monotone de- 
creasing Heilbronn’s sets). 

Theorem 8. 

Proof. For the lower bound we use the construction h = ^ and = 1 — (^)^ 
(for 1 < i < n). A simple calculation shows that in this example 

^(conv)/p p p \ _ (j ~ 0(fc ~ 0(fc ~ j) 

^ ^ 2(z -I- j -I- n)(i -I- /c -I- n)(j -I- A: -I- n) 

whose minimum is easily verified to be 0(^). 

For the upper bound we follow again the argument of dropping segments. 
Then G 2 {n) is upper bounded by fxy^, where x and y are the lengths of the 
two remaining consecutive segments, and A is the area of the triangle spanned 
by these two segments. But A^jxy'^ < j(|)sin^(^) = 0{^). 

6 Summary 

In this paper we show a relation between small-face problems in arrangements 
of lines and Heilbronn-type problems in point sets. We use a duality between the 
two classes for obtaining bounds for these problems. We summarize in Table 0 
the best bounds that we are aware of. (There is a slight abuse of notation in the 
synonyms of Gi and ^ 2 -) Note that every lower (resp., upper) bound trivially 
applies for columns on the left (resp., right) of it. We mention such trivial bounds 
in square brackets. The main open problem is to obtain tight bounds for some 
of the problems. 
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Measure 




Construction 




Arrangement 


Point 


Narrow (Arng.) 


Transp. (Arng.) 


“Convex” (Arng.) 


of Lines 


Set 


General (Heil.) 


Mon. Dec. (Heil.) 


Conv. Dec. (Heil.) 




00: A 






®(;u) 


Fv. V.H. 


Gi: A/P 








Ti'. Area 


G2. A^/P^ 


Unbounded 




®(;u) 



Table 2. Summary of known bounds 



0^™°n)(n) = 0( is given in the unpublished manuscript |S|- It inspired the 

simplified proof given in this paper with the kind permission of Joseph Mitchell. 
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Abstract. One strategy for the enumeration of a class of objects is 
local transformation, in which new objects of the class are produced by 
means of a small modification of a previously-visited object in the same 
class. When local transformation is possible, the operation can be used 
to generate objects of the class via random walks, and as the basis for 
such optimization heuristics as simulated annealing. 

For general simple polygons on fixed point sets, it is still not known 
whether the class of polygons on the set is connected via a constant-size 
local transformation. In this paper, we exhibit a simple local transforma- 
tion for which the classes of (weakly) edge- visible and (weakly) externally 
visible polygons are connected. The latter class is particularly interesting 
as it is the most general polygon class known to be connected under local 
transformation. 

1 Introduction 

Even for small instances, strategies for combinatorial enumeration must be very 
efficient in order to handle the huge numbers of objects which are often produced. 
In particular, they must avoid both the excessive recomputation of individual 
objects, and the excessive computation of objects which are outside the class 
under consideration. One way of limiting the construction of invalid objects is 
through the use of local transformation, in which new objects of the class are 
generated only from previously-examined objects by means of small changes. The 
class must be connected with respect to the local transformation in question: that 
is, every valid object must be reachable from some initial valid object by means 
of a finite sequence of transformations. Local search and optimization methods 
such as reverse search enumeration random walks and simulated annealing 
all make use of local transformation to visit new objects. 

* Partially supported by CUR Gen. Cat. 1999SGR00356 and MEG-DGES-SEUID 

PB98-0933 
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Within the discipline of computational geometry, interest in local transforma- 
tion can be traced back to the investigation of the connectivity of Delaunay tri- 
angulations under the diagonal flip operation PE) . Recent connectivity results 
have been established for triangulations, including triangulations of polygons 
rnn . topological triangulations nn, and triangulations in higher dimensions 
HI. Other classes of objects, such as non-crossing trees and Euclidean matchings 
have also been studied I5ltil . 

One class of great importance to computational geometry, but for which 
no satisfactory enumeration method is yet known, is that of simple polygons. 
Given a set of n points S, one can generate all simple polygons on S by con- 
sidering each of the possible Hamiltonian circuits of S, and then testing 

each for self-intersection. However, this approach is hardly practical, as the pro- 
portion of circuits which are simple diminishes (at least) exponentially with n. 
Zhu, Sundaram, Snoeyink, and Mitchell have provided a method for generating 
x-monotone polygons in 0{n^) time which does not use local transformation; 
unfortunately, there is no obvious way of extending this to other classes of poly- 
gons. 

As in the case of triangulations, if any simple polygon on S could be trans- 
formed into any other by means of a finite sequence of ‘local’ transformations, 
then methods such as reverse search enumeration could be applied. The time 
complexity of the method would depend heavily on the number of edges modi- 
fied by the transformation. 

1.1 Edge Flips 

Let S' be a set of n points not all collinear. If two distinct polygons P and P' on 
S differ in exactly k edges, then we shall say that P is (edge) k-flippable from P 
to P' . Alternatively, we say that P is transformable to P' by means of a k-flip. 
The edges of P not contained in P' are called the leaving edges, and the edges 
of P' not contained in P are called the entering edges. 

Figure 1 gives a characterization of A:-flips for k = 2 and fc = 3. A few special 
cases are worth mentioning: 

— If the vertices of the internal chain of a 2-flip are collinear (or if the chain 
consists of a single edge), the 2-flip is called a line flip (L-flip). 

— If two leaving edges of a 3-flip share a vertex in common, the flip will be 
referred to as a vertex-edge flip (VE-flip). 

— A flip is called planar if no leaving edge intersects any entering edge, except 
perhaps at their endpoints. 

Planar flips are of computational interest, in that the entering edges are 
edges of the visibility graph of P. Visibility graphs of simple polygons can be 
computed in 0{M) time, where M is the number of edges of the graph [Zj. Using 
the visibility graph, all candidate L-flips and planar VE-flips of a given polygon 
can be generated in a straightforward manner, in 0{M) time. 

In 1991, David Avis posed the following question: can any simple polygon 
on S be transformed to any other simple polygon on S by means of a finite 
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2-flip L-flip 




Planar 3-flip Planar 3-flip Non-planar 3-flip 




Planar VE-flip Non-planar VE-flip 




Fig. 1. 2-flips and 3-flips. 




Fig. 2. A non-2-flippable, non-VE-flippable polygon. 



sequence of 2-flips? Unfortunately, the answer to Avis’ question turns out to be 
‘no’ 0. The 19-vertex polygon shown in Figure 2 cannot be 2-flipped or VE- 
flipped to any other polygon on S (although it is 3-flippable) . Arbitrarily large 
non-2-flippable and non-VE-flippable polygons may be exhibited. 

The natural question to ask at this point is, is there any constant k such that 
any polygon P on a given set of vertices S can be transformed into any other 
polygon P' via a finite sequence of fc-flips? For general polygons, the question is 
still very much open. In this paper, we show that two of the simplest operations, 
namely L-flips and planar VE-flips, together are sufficient to connect polygon 
classes with certain visibility properties: (weakly) edge-visible and (weakly) ex- 
ternally visible. These polygon classes are formally defined in the next section, 
although more information concerning visibility within polygons can be found in 
m- In Section El we prove the connectivity of edge- visible polygons under the 
VE-L-flip operation, in which either an L-flip or a planar VE-flip can be per- 
formed. The edge- visibility is taken with respect to a fixed edge. In Section E) we 
prove the connectivity of general edge-visible polygons, and of externally visible 
polygons. 
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2 Polygon Classes 

Let P = {po,pi,p 2 , . ■ . ,Pn-i) be a simple polygon in the plane. With respect 
to P, points s,t G B? are said to be mutually (clearly) visible if the open line 
segment st joining s and t does not intersect P. The visibility is said to be 
internal if st lies in the interior of P, and external otherwise. 

Let e be an edge of P. P is said to be (weakly) edge-visible from e if for every 
point s on P there exists a point t on e that is internally visible from s. One 
variant of edge visibility requires that t be in the relative interior of e - in such 
cases, it is not hard to see that e must be an edge of the convex hull of P. In 
this paper, we will assume that e coincides with an edge of the boundary of the 
convex hull, but will also consider some cases in which visibility to the endpoints 
of e is allowed. 

Algorithms for edge-visible polygons tend to be simpler and more efficient 
than their counterparts for general simple polygons, due to their special visibility 
properties. For example, the most important property for our purposes also 
allows edge- visible polygons to be triangulated in linear time, simply by cutting 
off convex vertices one by one: 

Property 1. |2j Every convex vertex of P other than the endpoints of e is an ear 
of P. 

Closely related to edge-visible polygons are (weakly) externally visible poly- 
gons. A polygon P is (clearly) externally visible if for each point s on the bound- 
ary, there exists a ray ps originating at s which avoids the interior of P, and 
which intersects P only at s. If s is on the boundary of the convex hull of P, this 
condition is trivially satisfied. Otherwise, s must be on some chain of P through 
the interior of the hull, with only its endpoints pi and pj on the hull boundary. 
Together with the lid segment (pi,pj), the chain forms what is referred to as 
a pocket polygon. As any external ray originating inside the pocket must inter- 
sect the lid in order to escape, every pocket of an externally visible polygon is 
edge- visible with respect to its lid (see Figure 3). 

Two well-known classes of polygons, monotone and shar-shaped, are sub- 
classes of the class of externally visible polygons. Although we have established 
connectivity for these classes as well, the details are omitted in this version of 
the paper, due to space limitations. 



3 Edge-Visible Polygons 

Instead of proving directly that a polygon of a given class can be transformed 
into another polygon from the same class, our strategy is to show that it can 
always be transformed into a unique ‘canonical’ polygon K. If this is so, polygon 
P could be transformed into another polygon P' by first transforming P into 
K, and then reversing the operations that transform P' into K . Before showing 
how this can be done for edge-visible polygons, we present a technical lemma 
concerning the visibility of subchains within a polygon. 
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Fig. 3. An externally visible polygon. 



3.1 Clearly Visible Chains 

Let S' be a set of n planar points, not all collinear. Let e be an edge of the convex 
hull boundary of S, and g be a point on e, such that the endpoints of e are the only 
pair of points of S forming a collinearity with q. Let P = (po,Pi,P 2 , • ■ ■ ,Pn-i) 
be an edge- visible polygon on S from e = (poiPn-i), with vertices listed in 
clockwise order about the interior. 

With respect to g, for 0 < j < m — 1, vertex pi is a left cusp if both pi-i 
and pi+i appear after pi in clockwise angular order about g. If instead both 
and pi+i appear before pi, then pi is a right cusp. The notion of a cusp extends 
to polygons which are not edge-visible, and even to polygonal chains, provided 
that adjacent vertices are not collinear with g. 

The following lemma shows the existence of a chain of P that is entirely 
visible from g, and whose endpoints are either endpoints of e or cusps. 

Lemma 1. Let P, e and q be as defined above. Then there exist indices 0 < a < 
b < n such that: 

1. every point of subchain {paTPa+i, ■ ■ ■ ,Pb) is visible from q, with the possible 
exceptions of pa if a = 0, and pt if b = n — 1; 

2. if a > 0, then Pa is a left cusp; 

3. if b < n — 1, then pb is a right cusp. 

Proof. Omitted. 

3.2 'Transformation to a Canonical Polygon 

Let us now consider the polygon spanning the points of S in such a way that 
its vertices appear consecutively in (clockwise) angular order about g. Polygon 
K'^ shall be called the canonical edge-visible polygon on S with respect to g. Note 
that is uniquely defined, and is simple. 

Any edge-visible polygon P on S with visibility to e can be given a score 
depending on the visibility of its vertices from g. Every vertex Si of P contributes 



On Local Transformation of Polygons with Visibility Properties 



59 




Fig. 4. Searching for a cusp behind chain P*, and attempting VE-flip with vertex 
Pc- 



either 0 or 1 to the score: Si contributes 1 if Si is an endpoint of e or if Si is 
visible from g; otherwise, the contribution of Si is 0. Every polygon on S with 
edge-visibility to e can be scored in this way. It is not hard to see that the 
score of the canonical polygon K‘^ is n, and that K‘^ is the only polygon on S 
which achieves this maximum score. Also, the minimum possible score is 3: a 
contribution of 1 from each endpoint of e, and 1 from that vertex of P closest 
to the line containing e (this vertex must be visible from q) . 

Lemma 2. Let S be a set of n planar points, not all collinear. Let e be an edge 
of the convex hull boundary of S , and q be a point on e, such that the endpoints 
of e are the only pair of points of S forming a collinearity with q. Lf P is an 
edge-visible polygon spanning S with visibility to e, then P can be transformed 
into canonical edge-visible polygon K'^ via a sequence of at most n — 3 VE-L-flips. 

Proof Let P be the polygon (po,Pi,P 2 , • • ■ ,Pn-i), with e = {po,Pn-i), and 
vertices listed in clockwise order about the interior. 

It suffices to show that if P yf K'^, there exists an improving VE-flip; that 
is, one which transforms P into another edge-visible polygon P' such that the 
score of P' is greater than that of P. 

Lemma m implies the existence of indices 0<a<b<n— 1 such that 

— a = 0 or Pq is a left cusp, 

— 6 = n — 1 or pb is a right cusp, and 

— P* = (pa,pa+i, . . . ,pb) is entirely visible from q — the only possible excep- 
tions being at Pa if a = 0, or pf, if 6 = n — 1. 

The only situation in which a could equal 0 and b could equal n — 1 is if P 
were the canonical polygon K'^. Since by assumption P K'^, either a > 0 or 
b < n — 1. We will consider only the case where a > 0, as the arguments for the 
case where b < n — 1 are virtually identical. 

Starting from pa, we search for a vertex and edge to participate in a VE-flip, 
by performing a radial sweep clockwise about q. As the sweep ray progresses, the 
closest intersection points (to q) with both P* and P\ P* are maintained, the 
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first intersections being on edge {pa,Pa+i) of P* and edge {pa,Pa-i) of P \ P* . 
Note that the visibility of P* to q implies that the chain is radially monotone 
with respect to g; that is, any ray from q intersects P* in at most one point (see 
Figure 4). The sweep continues until a cusp Pc of P \ P* is encountered. We 
claim that this must occur before the sweep ray reaches pb- 

Since Pc is a cusp, it must be a convex vertex. From the edge- visible property 
of P, it must therefore be an ear. Let (pj,pj+i) be the edge of P* intersected 
by the ray through pc, where a < j < 6. If Pc is visible from both pj and Pj+i, a 
VE-flip is possible: edges (pc-i,Pc), (PcPc+i) and (j>j,pj+i) could be replaced 
by edges (pc-i 7 Pc-i-i), iPcPj) and (pcPj+i)- Due to the radial monotonicity of 
P* with respect to q, the removal of edge (j>j,pj+i) would result in pc becoming 
visible from q. The internal visibility to e and to q of all other vertices would 
remain unchanged; the VE-flip would thus yield an edge- visible polygon of higher 
score than P. However, it is possible that some part of the polygon interferes 
with the visibility from pc to Pj+i (see Figure 4). 

If Pc is not visible from both pj and Pj+i, we search for an alternative to Pc 
for the flip with (pj,pj^i). Let I be the line segment joining q and Pc, and let x 
be a point of I between (pj^pj+i) and Pc- Imagine a second sweep in which the 
point X begins on (pj^pj+i) and slowly moves along I towards Pc- As x moves, 
the empty triangular area bounded by x, pj and Pj+i would grow, until one 
or both of its sides comes into contact with at least one vertex of P\P*. Let us 
assume that contact occurs along the side (x,pj+i) of the triangle; the case in 
which the only contact is along {x,pj) is handled symmetrically. 

For contact to occur along (x,pj+i), there must exist b<d<d'<n— 1 
such that 

— Pi is collinear with {x,pj+i) for all d < i < d' , 

— Pd'+i is not collinear with (x,pj+i), and 

— Pi is not collinear with (x,pj+i) for all j -I- 1 < z < d. 

There are two cases remaining to be considered. 

1. j -t- 2 < d. 

The segment (pj,pd) passes through the interior of Z\^, and thus pj and pd 
are mutually visible. The construction guarantees that pd and Pj+i are also 
visible, and that pd is a convex vertex. Here, a VE-flip is possible: edges 
{pd-ijPd), {pd,Pd+i) and {pj,pj+i) could be replaced by edges {pd-i^Pd+i), 
(pd,Pj) and (j>d,Pj+i)- By the same arguments as when pc was visible from 
Pj and Pj+i, the VE-flip would yield an edge- visible polygon of higher score 
than P (see Figure 5). 

2. j 4- 2 = d. 

The segment (pd',Pj) passes through the interior of A^, and thus pd' and 
Pj are mutually visible. The collinearity of (pj+i,pj+ 2 , . . . ,pd') and the fact 
that Pd’ is a convex vertex together imply that Pd’+i and Pj+i are mutually 
visible. To see this, consider the polygon that would result if the collinear 
chain {pj+i,pj+2, ■ ■ ■ ,Pd') were replaced by the new edge {pj+\,pd')- This 
resulting polygon would also be edge- visible with respect to e, and pd' would 
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Fig. 5. Two cases: no contact with Pj+2', contact with Pj+2- 



be an ear. Here, an L-flip is possible: edges {pd' , Pd' +i) and (pj,pj+i) could be 
replaced by edges {pj,pd') and {pj+i,pd'+i)- With the deletion of (pj,pj+i), 
each of the vertices (j>j+2,Pj+3, ■ ■ ■ ,Pd') would become visible to q without 
disturbing the visibility to q of any other vertices. The L-flip would thus 
yield an edge- visible polygon of higher score than P (see Figure 5). 

It should be observed that as long as no other vertex of P is collinear with 
the endpoints of e, it is always possible to choose some point g on e satisfying 
the non-collinearity condition of the lemma. The main theorem of this section 
follows as an immediate corollary. 

Theorem 1. Let S be a set of n planar points, not all collinear. Let P and P' 
be edge-visible polygons spanning S , with visibility to a common convex hull edge 
e. Then P can be transformed into P' via a sequence of no more than 2n — 6 
VE-L-flips. 

4 Other Polygon Classes 

The methods of the previous section for proving connectivity under the VE-L- 
flip operation of edge-visible polygons can be extended to other classes as well. 
The general strategy is the same: define a score function and a unique canonical 
polygon which maximizes it, and then show that there always exists an improving 
VE-L-flip for every polygon other than the canonical. 

4.1 Externally Visible Polygons 

Let point set S, convex hull edge e, and point g on e be defined as in the previous 
section. Let h be the number of points of S on the boundary of its convex hull. 
The canonical externally visible polygon K‘^ = (ko, ki, , kh-i, kh, ... , kn-i) is 
defined as follows: 

— the endpoints of e are k^ and ku-i] 

— the subchain (A:o, k\, . . . , kh-i) of K'^ spans the points of S on its convex hull 
boundary, in counterclockwise order about the interior; 
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— the subchain (fc/i, kh+i, . . . , fc„_i) of K'^ spans the points of S interior to the 
convex hull; 

— the vertices of subchain {kh-i, • ■ ■ , fco) appear consecutively in clock- 
wise angular order about q. 

Again, is uniquely defined, and is simple 

An externally visible polygon P on S will be scored as follows: vertex Si in 
the interior of the convex hull of S contributes 1 if Si has external visibility to 
q; otherwise, if Si is not externally visible to q or if si is on the convex hull 
boundary, the contribution of si is 0. The score of the canonical polygon 
is n — h, and no other externally visible polygon on S achieves this maximum 
score. The minimum possible score is 0, occurring whenever e is an edge of P. 
The following lemma is a corollary of Lemma El 

Lemma 3. Let S be a set of n planar points, not all collinear. Let e he an edge 
of the convex hull boundary of S, and q be a point on e, such that no pair of 
points form a collinearity with q unless they are both collinear with e. Lf P is an 
externally visible polygon spanning S , then P can he transformed into canonical 
externally visible polygon K‘^ via a sequence of at most n — h VE-L-flips. 

Proof. The details have been omitted, as the proof is virtually identical to that 
of Lemma 0 

Lemma 0 immediately implies the connectivity result for externally visible 
polygons. 

Theorem 2. Let S be a set of n planar points, not all collinear. Let P and P' 
be externally visible polygons spanning S. Then P can be transformed into P' 
via a sequence of no more than 2n — 2h VE-L-flips. 

4.2 Edge- Visible Polygons with Differing Visibility Edges 

Theorem 3. Let S be a set of n planar points, and let S* be the subset of S 
lying on the boundary of the convex hull of S . Assume that no point s of S* is 
collinear with any pair of points of S\{s}. Let P and P' he edge-visible polygons 
spanning S, with visibility to convex hull edges e and e! , respectively. Then P can 
be transformed into P' via a sequence of no more than |" {n — 3) VE-L-flips, 
where h is the cardinality of S* . 

Proof. Consider the shorter of the two sequences of edges between e and e' 
along the boundary of the convex hull of S. If both sequences have the same 
number of edges, then choose one arbitrarily. Let the chosen sequence be if = 
(ei, 62 , ... , Cm), where ei = e and Cm = e'. Let the endpoints of Cj be (si-i, Si) 
for all 1 < i < TO. 

To transform P into P' , we first transform P to the canonical edge-visible 
polygon iC^i with respect to a point qi on ei very close to si: qi is chosen so 
that no line passing through a pair of points of S' \ {sq, si} separates it from si. 
This can be done via a sequence of at most n — 3 VE-L-flips, by Lemma El 
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Since si is assumed not to be involved in any collinearity among the points 
of S, there exists a point q on 62 from which no line of collinearity separates qi. 
This implies that q and qi are both visible from every point on the boundary 
of which in turn implies that K‘^ = is edge- visible with respect to 62- 

can then be transformed into a new canonical edge- visible polygon for 
some point arbitrarily close to S2, again by Lemma 

This process may be continued, choosing points qi on sufficiently close to 
Si for all 1 < f < TO. The result, after to — 1 applications of Lemma | 2 l is a 
polygon edge- visible with respect to e' = Cm- A final application of Lemma El 
shows that this polygon can then be transformed to P'. 

The total number of flips required is bounded by m(ji—S). Since E is chosen to 
be the smaller of the two sequences available, it is easy to show that to < [ . 
From this, the result follows. 

References 

1 . D. Avis and K. Fukuda, ‘Reverse search for enumeration,’ Discr. Appl. Math. 
65:21-46, 1996. 

2. D. Avis and G. T. Toussaint, ‘An optimal algorithm for determining the visibility 
of a polygon from an edge,’ IEEE Trans. Comput. 0-30:910-914, 1981. 

3. H. Edelsbrunner and N. R. Shah, ‘Incremental topological flipping works for 
regular triangulations,’ Algorithmica 15:223-241, 1996. 

4. S. Fortune, ‘Voronoi diagrams and Delaunay triangulations,’ in Computing in 
Euclidean Geometry, D. Z. Du and F. K. Hwang, eds., World Scientific, pp. 
193-234, 1992. 

5. C. Hernando, F. Hurtado, A. Marquez, M. Mora and M. Noy, ‘Geometric tree 
graphs of points in convex position,’ Discr. Appl. Math. 93:51-66, 1999. 

6. G. Hernando, F. Hurtado and M. Noy, ‘Graphs of non-crossing matchings,’ Proc. 
15th European Conference on Computational Geometry, pp. 97-100, 1999. 

7. J. Hershberger, ‘Finding the visibility graph of a simple polygon in time propor- 
tional to its size,’ Algorithmica 4:141-155, 1989. 

8. M. E. Houle, ‘On local transformations of simple polygons,’ Australian Comp. 
Sci. Commun. 18(3):64-71 (Proc. CATS’96, Melbourne), 1996. 

9. F. Hurtado and M. Noy, ‘Graphs of triangulations of a convex polygon and tree 
of triangulations,’ Comput. Geom. Theory Appl. 13:179-188, 1999. 

10. F. Hurtado, M. Noy and J. Urrutia, ‘Flipping edges in triangulations,’ Diser. 
Comp. Geom. 22:333-346, 1999. 

11. B. Joe, ‘Construction of three-dimensional Delaunay triangulations using local 
transformations’. Computer Aided Geom. Design 8:123-142, 1991. 

12. C. L. Lawson, ‘Transforming triangulations,’ Discr. Math. 3:365-372, 1972. 

13. J. O’Rourke, Art Gallery Theorems and Algorithms, Oxford University Press, 
New York, 1987. 

14. M. Pocchiola and G. Vegter, ‘Computing the visibility graph via pseudo- 
triangulations’ Proe. 11th ACM Symp. Comp. Geom., 1995, pp. 248-257. 

15. C. Zhu, G. Sundaram, J. Snoeyink, and J. S. B. Mitchell, ‘Generating random 
polygons with given vertices,’ Comput. Geom. Theory Appl. 6:277-290, 1996. 



Embedding Problems for Paths with Direction 
Constrained Edges* 



Giuseppe Di Battista, Giuseppe Liotta^, Anna Lubiw^, and Sue Whitesides^ 

^ Dipartimento di Informatica ed Automazione, Universita di Roma Tre, Roma, Italy. 

gdbOdia . uniromaS . it 

^ Dipartimento di Ingegneria Elettronica e dell’Informazione, 

Universita di Perugia, Perugia, Italy. 

liottaSdiei . unipg . it 
® Department of Computer Science, 

University of Waterloo, Waterloo, Canada. 
alubiwOdaisy . uwaterloo . ca 
^ School of Computer Science, 

McGill University,Montreal, Canada. 
sue@cs . mcgill . ca 



Abstract. We determine the reachability properties of the embeddings 
in of a directed path, in the graph theoretic sense, whose edges have 
each been assigned a desired direction (East, West, North, South, Up, 
or Down) but no length. We ask which points of R^ can be reached by 
the terminus of an embedding of such a path, by choosing appropriate 
positive lengths for the edges, if the embedded path starts at the origin, 
does not intersect itself, and respects the directions assigned to its edges. 
This problem arises in the context of extending planar graph embedding 
techniques and VLSI rectilinear layout techniques from 2D to 3D. We 
give combinatorial characterizations of reachability that yield linear time 
recognition and layout algorithms. 



1 Introduction 

The shape of a directed, polygonal curve in 3D that consists of n axis-parallel 
straight line segments is the sequence cr of n direction labels East, West, North, 
South, Up, or Down determined by the directions of the n segments of the curve. 

A shape for a directed, graph theoretic path P is an ordered sequence cr 
of n labels East, West, North, South, Up, or Down, one for each of the n 
edges of P. Each label in the sequence specifies a direction for the corresponding 
directed edge of P when that edge is realized as a straight line segment in 3D. 
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A realization of a graph theoretic directed path P as an orthogonal polygonal 
curve in 3Z3 is specified by giving a start point, a shape cr, and an assignment of 
positive lengths to the edges. 

Given a point p in 3D and a shape a for a directed, graph theoretic path P, 
the 3D shape path reachability problem is to find an assignment (if one exists) of 
positive integer lengths to the directed edges of P so that the resulting realization 
of P as an orthogonal, polygonal curve 

— ends at p if it starts at the origin; 

— is simple, that is, does not intersect itself; and 

— satisfies the direction constraints on its segments as specified by cr. 

The 3D shape path reachability problem has instances that do not admit 
solution. For example, consider a point p that lies East of the origin and the 
shape cr given by the sequence of labels UWDESWN, where U stands for Up, 
W stands for West, and so on. Shape a cannot be the shape of any simple, or- 
thogonal, polygonal realization of a path that starts at the origin and terminates 
at p, even though cr contains an E label (see Figure^. 




Fig. 1. No path of shape a = UWDESW N can start at the origin O and reach 
a point p East of O. 



The main result of this paper is a combinatorial characterization of those 3D 
shape path reachability problem instances that admit solution, together with 
linear time recognition and embedding algorithms based on this characterization. 

Various 2D versions of our 3D shape path problem have been studied in 
several papers, including uniioj. The problem has also been generalized to non- 
orthogonal polygons and graphs in HHHIZI 

A basic technique for 2D orthogonal layout in VLSI and in graph drawing 
is to give a “shape” for a graph, that is, an assignment of direction labels E, 
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W, N, and S to the edges, then to determine lengths for the edges so that the 
layout is non-crossing [Vijayan and Widgerson] ( m)- The graph must have 
maximum degree 4. In the VLSI context, each vertex represents either a corner 
of a bounding box containing components, or a connection pin on the side of 
such a box. The edges at a vertex connect it to its two neighbors (pins or box 
corners) on the bounding box and, in the case of a pin vertex, to a pin on another 
box. 

The well-known topology-shape-metrics approach 0 for constructing a recti- 
linear embedding of a planar graph consists of three main steps, called planariza- 
tion, orthogonalization, and compaction. The planarization step determines an 
embedding, i.e., the face cycles, for the graph in the plane. The orthogonaliza- 
tion step then specifies for each edge {u, v), an ordered list of the bends that the 
orthogonal polygonal line representing {u,v) is to have in the final embedding. 
For example, {u, v) could be labeled NESNE, which would say “starting from 
u first go North, then go East, etc.” Finally, the compaction step computes the 
final embedding, giving coordinates to vertices and bends. 

While the literature on 3D orthogonal embeddings of graphs is quite rich (see 
e.g. the extension of the topology-shape-metrics approach to 

3D remains, as far as we know, to be carried out. The 3D shape path reachability 
results we present here are an essential prerequisite for such a program. 



2 Overview of the Main Results 

In order to state our characterization result precisely, we introduce the concepts 
of a flat and of a canonical sequence in a shape. 

Let (7 be a shape. A flat of cr is a consecutive subsequence cr' of a that is 
maximal with respect to the property that its labels come from the union of two 
oppositely directed pairs of directions, i.e., either from the set {N, S, E, W}, or 
from the set {N, S, U, D}, or from the set {U, D, E, W}. Thus any realization of 
the shape sequence a' must consist of segments that lie on the same axis-aligned 
plane. For example, the shape a = UWDESWN depicted in Figure ^contains 
two flats: El = UWDE, and E 2 = ESWN. Observe that two consecutive flats 
of a share a label and that they must be drawn on perpendicular planes. For 
example, the last label, E, of Fi coincides with the first label, E, of F 2 . 

A not necessarily consecutive subsequence r C cr, where t consists of k 
elements, is a canonical sequence provided that: 

— 1 < fc < 3; 

— any two elements of r indicate mutually orthogonal directions; and 

— the elements of r that appear on any given flat of a form a possibly empty 

consecutive subsequence of cr. 

The type of a canonical sequence r is given by the (unordered) set of labels 
it contains. 

For example, the shape cr = UWDESWN depicted in Figure IDcontains two 
canonical sequences of type {U,N,W}. The first two elements of cr are a U and a 
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W. These elements belong to the same flat, Fi, where they appear consecutively; 
they do not belong to F 2 . The last element of cr is an N, which belongs to flat 
F 2 but not to Fi . Thus the conditions for a canonical sequence of length fc = 3 
and type {U,N,W} are satisfled by the subsequence of a consisting of its first, 
second, and last elements. The first, next-to-last, and last elements of a form a 
second canonical sequence of type {U,N,W}. Note, however, that cr does not 
contain a canonical subsequence of type {U, E, N}: these labels occur in unique 
positions in a, and in particular, the unique U and the unique E both belong to 
the same flat Fi, but they are not consecutive elements of a. 

The essence of our combinatorial characterization of solvable shape path 
reachability problems is given below. For concreteness, the characterization is 
given with respect to the UNE octant. As explained in the next section, the 
results for other octants can be obtained by a suitable permutation of the labels. 

Theorem 1. Let a be a shape and let p be a point in the UNE octant. Then 
there exists a simple, orthogonal, polygonal curve of shape a that starts at the 
origin and that terminates at p if and only if a contains a canonical sequence of 
type {U,N,E}. 

In other words, p can be reached, starting from the origin, by a simple polyg- 
onal curve of shape a if and only if it is possible to choose from a & U element, 
an N element and an E element, not necessarily in that order, such that if any 
flat of (7 contains two of these elements, then the two elements are consecutive 
in a. 

3 Preliminaries 

We regard each coordinate axis in the standard 31? coordinate system as con- 
sisting of the origin plus two open semi-axes, directed away from the origin and 
labelled with a pair of opposite direction labels from the set {N,W,S,E,U,D}. 
A triple XYZ oi distinct unordered labels no two of which are opposite defines 
the XYZ octant. Note that unless stated otherwise, we consider octants to be 
open sets. Similarly, a pair XY of distinct orthogonal labels defines the XY 
quadrant in 2D or 31?. Finally, a label X defines the X semi-axis, which consists 
of those points that are positive multiples of the unit vector in the X direction. 
For short we call a semi-axis an axis. Thus, 3Z? space is partitioned into eight 
octants, twelve quadrants, six axes, and the origin. 

Let (T be a shape consisting of n elements or labelfl An embedding or drawing 
of a, denoted as F{a), is a non-self-intersecting, directed, orthogonal polygonal 
curve consisting of n segments such that the A:-th segment {k = l,...,n) of 
F{a) has positive length and has the orientation specified by the fc-th label of 
cr. Unless stated otherwise, we assume that the start of the directed curve F{a) 
lies at the origin of the reference system. 

^ Since the elements of a shape sequence have direction labels as values, we often refer 
to the elements as labels. 
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Assumptions. Since we are interested in shapes that admit embeddings, we as- 
sume from now on that shapes do not contain adjacent labels that are oppositely 
directed. Furthermore, we assume that shapes do not contain adjacent labels that 
are identical, since the reachability properties of shapes with adjacent, identical 
labels are not changed by coalescing such labels. Finally, we assume that the 
point p to be reached in an instance of the shape path reachability problem lies 
in an octant. 

Remark. Let (f>{) be a permutation of the six direction labels that maps opposite 
pairs of labels to possibly different opposite pairs (for example, <j) might map 
N, S, E, W, U, D to E, W, N, S, D, U, respectively). Note that 4>Q defines a linear 
transformation of 3D space that is not necessarily a rotation. Nevertheless, this 
transformation defines a bijection between embeddings of cr and embeddings of 
(j){a). Here, (j){a) denotes the sequence of labels obtained by applying (j) to the 
labels of ct. For concreteness, we often state our results and proofs referring 
to some given octant, quadrant, or axis where points of embeddings of cr can 
lie. However, the results can also be stated with respect to any other octant, 
quadrant, or axis since they are preserved under the ^() transformation. 

Definition. A drawing E{a) of a shape a is an expanding drawing if each segment 
travels one unit farther in its direction than the extreme points, with respect to 
that direction, of the previous segments of F(cr). 

Expanding drawings are useful because a new segment of arbitrary length, 
perpendicular to the last segment of the drawing, may be appended to the last 
segment without causing any collisions. The definition above is a simple, tech- 
nical way to achieve this property. 




Fig. 2. The expanding drawing for cr = ENWSEDNUNDSWUNUED. The 
drawing starts at the origin, and each segment travels one unit farther in its 
direction than any preceding segment has gone. 
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Lemma 1. Every shape admits an expanding drawing. 

In the next sections we shall characterize the reachabiity properties of shapes 
by means of canonical sequences. 



4 The Shape Path Reachability Problem 

We say that a drawing E{a) of a shape a reaehes point p if E{a) terminates at 
p when it starts at the origin. A shape a reaehes a point p if it admits a drawing 
r(cr) that reaches p. A shape a reaehes a set of points if it reaches each point 
in the set. For example, shapes NEU, NUE, and NWUE all reach the UNE 
octant; that is, for each point of the UNE octant, each of these shapes has a 
drawing that terminates at that point. 

Lemma 2. Let a he a shape. If a reaches a point p in an axis, quadrant or 
octant, then it reaches that entire axis, quadrant or octant, respectively. 

Observe that if a shape reaches the UNE octant, then it must contain a 
U label, an N label and an E label. Notice, however, that the converse is not 
always true. As previously observed, the shape UWDESWN does not reach the 
UNE octant, even though it contains a U label, an E label, and an N label. 

In the rest of this section, we investigate the problem of determining whether 
a shape can reach a given portion of 2D or 3Z? space. We distinguish between 
shapes that consists of only one flat and shapes that have more than one flat. A 
shape of the first type is a 2D shape, while a shape of the second type is a 3D 
shape. 



4.1 Reachability in 2D 

In this section we answer the following question: Given a point p in a quadrant 
or on an axis and a 2D shape cr, can a reach pi By Lemma |3 this question 
can be answered by characterizing when cr can reach the quadrant or the axis 
containing p. 

We start by investigating a special type of reachability. Let E{a) be a drawing 
of cr that reaches the NE quadrant. We say that E{a) reaches the NE quadrant 
for the first time with label A if A G {N, S, E,W} is the direction associated 
with the first segment of E(a) entering the NE quadrant when walking along 
E(a) starting at the origin. 

Lemma 3. Let E{a) he a drawing of a 2D shape a = a'E such that E(a) 
reaches the NE quadrant for the first time with its last label, E. Then a contains 
a canonical sequence NE. 



Lemma 4. Let a = a' a" be a 2D shape such that a' reaches the NE quadrant. 
Then cr reaches the NE quadrant. 
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We are now ready to consider the more general problem of quadrant reach- 
ability. 

Theorem 2. A 2D shape a reaches the NE quadrant if and only if a contains 
a canonical sequence of type {N,E}. 

Proof. We prove first that if a contains a canonical sequence r of type {N,E}, 
then it reaches the NE quadrant. Let a — a\NEa 2 , where a\ or U 2 may be the 
empty string. By Lemma QJ a\ has an expanding drawing Extend r{ai) 

to a drawing for a as follows. Append to the terminal point of T(cti) a segment 
oriented N that is so long that i) its terminal point projects to the N axis and ii) 
the resulting drawing is still an expanding drawing. Because the new drawing is 
expanding we can append to its terminal point a segment oriented E that is long 
enough to enter the NE quadrant. Thus a\NE C a reaches the NE quadrant, 
and hence by Lemma 0 so does a. 

Suppose now that cr = Xi, . . . reaches the NE quadrant, and let E(a) 
be a drawing of cr that terminates at a point in the NE quadrant. Suppose the 
first segment of E{a) to intersect the NE quadrant is the segment, whose 
associated direction is Xi. Let cr' denote the initial sequence Xi,...Xi of cr. 
Then P{a) contains a drawing E{(j') of a' . If Xi = E, then by Lemma 0, a' and 
hence cr contains the consecutive subsequence NE. Similarly, if Xi = N, then cr 
contains the consecutive subsequence EN. 

Based on the results above and on Lemma |3 it is possible to design a lin- 
ear time algorithm for deciding whether a 2D shape cr can reach a point of a 
quadrant. 

Theorem 3. Let a be a 2D shape, and let p he a point of a quadrant. There 
exists a linear time and space algorithm that decides whether a reaches p. 

4.2 Reachability in 3D 

In this section we answer the following question: Given a point p in an octant and 
a 3D shape cr, can cr reach pi Again, by Lemma El this question can answered 
by characterizing when cr can reach a given octant. To this aim, we introduce 
the key concept of a doubly expanding drawing. This concept will later be used 
for the study of shape path embeddability. 

A doubly expanding drawing is one for which the first and last segments can be 
replaced by arbitrarily long segments without creating any intersections within 
the drawing. Figure 0 shows an example of a doubly expanding drawing. 

Lemma 5. Let a be a shape with n labels such that it either consists of exactly 
two labels or it contains at least two flats. Then a has a double expanding drawing 
that can be computed in 0{n) time. 

The next lemma gives a sufficient condition for octant reachability; the proof 
(omitted due to space constraints) is based on connecting together singly and 
doubly expanding drawings for subsequences of shapes and gives rise to a linear 
time embedding algorithm. 
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* Up 



North 




Fig. 3. A doubly expanding drawing for a = WNENESWNUNUSWUNEDS. 



Lemma 6. Let a be a 3D shape that contains a canonical sequence of type 
{U,N,E}. Then a reaches the UNE octant. 

The next lemma proves that the condition of Lemma El is also necessary for 
octant reachability. 

Lemma 7. Let a be a 3D shape that reaches the UNE octant. Then a contains 
a canonical sequence of type {U, N, E}. 

Proof. First, we begin by proving a weaker form of this: 

If cr is a 3D shape that can reach the NE quarter-space (i.e., the union of the 
UNE octant, the DNE octant, and the NE quadrant), then one can choose 
one N and one D in cr that either are consecutive or do not share a flat. 

To prove this, consider a drawing D(cr) that reaches the (open) NE quarter- 
space. Let (a, b) be the first edge of T that enters this quarter-space. Suppose 
without loss of generality that (a, b) is an E edge. Since this E edge lies in the N 
half-space, there must be a previous N edge, (c, d), that enters this half-space. 
If the chosen E and N edges do not share a flat, we are done. So assume that 
they do share a flat. Let LL be the NSEW plane containing the flat. Note that 
b is in the (open) NE quadrant of LI. We claim that c is in the closure SW of 
the W quadrant. This is because d, which is in the N half-plane but not in the 
NE quadrant, must lie in the NW quadrant. The portion of T from c to b thus 
travels to the NE quadrant. By the Lemma Elone can And a consecutive pair 
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NE or EN in the corresponding portion of a. This completes the proof of the 
preliminary result. 

To obtain a proof of the original statement, consider a drawing E{a) that 
reaches the ENU octant. Let {p, q) be the first edge of E that enters this octant, 
and suppose without loss of generality that {p, q) is a. U edge. Since this U edge 
lives in the NE quarter-space, there must be a previous edge, (r, s), that enters 
this quarter-space. Assume without loss of generality that (r, s) is an N edge. 
Note that q is in the ENU octant, s is in the END octant, and r is in the ESD 
octant. 

The path from the origin to s travels to the EN quarter-space, and thus by 
the preliminary result just proved, one can choose in the corresponding portion 
of cr an if and an N that either are consecutive or do not share a flat. The path 
from r to q travels to the NU quarter-space, and thus by the preliminary result 
above, one can choose in the corresponding portion of cr an iV and a U that are 
either consecutive or do not share a flat. Note that the second pair of chosen 
labels follows the first pair in a, except that the N edge (r, s) may be in both 
pairs. If it is, we have what we want. 

Now consider the other possibilities. The chosen letters may appear in the 
order EN1N2U, with the N edge (r, s) in the closed interval from A^i to N2. We 
claim that either A^i and U do not share a flat, or E and N2 do not share a flat, 
for if both occurred then all four letters would share a flat. Thus at least one of 
A^i or N2 will sufflce to establish the lemma. 

As a next possibility, the chosen letters may appear in the order N1EN2U, 
with the N edge (r, s) in the closed interval from E to N2- E and U cannot share 
a flat, because such a flat would also have to contain N2, and thus Ni suffices. 
The case EN1UN2 is similar — N2 suffices. 

The final possibility is that the chosen letters may appear in the order 
N1EUN2 with the N edge (r, s) in the interval from E to U. Because the N 
edge intervenes, E and U cannot share a flat, and thus either of A^i or N2 will 
suffice. 

Lemmas 0 and Q can be summarized as follows. 

Theorem 4. Let a be a 3D shape. Shape a reaehes the UNE oetant if and only 
if it eontains a eanonieal sequenee of type {U, N, E}. 

Theorem0and Lemma|2lyield a linear time algorithm for deciding whether a 
3D shape a can reach a point in an octant. Given a point p that can be reached, 
the linear time algorithm in the proof of Lemma|H|then gives an embedding that 
terminates at p. 

Theorem 5. Let a he a 3D shape with n labels, and let p be a point of an 
oetant. There exists an algorithm that decides whether a reaehes p, that runs in 
0 {n) time, and that computes an embedding for a that reaches p when such an 
embedding exists. 
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5 Open Problems 

Several issues remain open. We mention three that in our opinion are interesting. 

1. Study the reachability problem under the additional constraint that the 
lengths of some of the segments are given. 

2. Study the reachability problem in the presence of obstacles. 

3. Study the reachability problem for paths that must be embedded in a fixed 
grid and that are to reach specified points in this grid. 

In the case of the last item, it is no longer true that a shape can reach a 
point if and only if it can reach all the other points in the same octant. One can 
no longer construct a drawing and then scale it so that its terminus is located 
at some specified point p, as scaling would move some edges off grid lines. For 
drawings in a fixed grid, volume becomes a meaningful issue to study. 
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Abstract. In this paper we give a characterization of level planar graphs 
in terms of minimal forbidden subgraphs called minimal level non-planar 
subgraph patterns (MLNP). We show that a MLNP is completely char- 
acterized by either a tree, a level non-planar cycle or a level planar cycle 
with certain path augmentations. These characterizations are an impor- 
tant first step towards attacking the A/”P-hard level planarization prob- 
lem. 



1 Introduction 

Level graphs are an important class of graphs that generalize bipartite graphs. 
Such graphs are used to model hierarchical relationships or workflow diagrams. 
From the point of view of representing the relationships graphically, it is desirable 
that the graphs are drawn with as few edge crossings as possible. Jiinger et al. 0 
have presented a linear-time algorithm to determine if a level graph is planar; 
this algorithm can be modified to determine a planar embedding of the graph 
as well. 

Clearly, level graphs that need to be visualized are not level planar in general. 
Mutzel [H] therefore studies the level planarization problem for the case k = 2 
levels, removing a minimum number of edges such that the resulting subgraph 
is level planar. For the final diagram the removed edges are reinserted into a 
level planar drawing. However, the level planarization problem is AfT^-hard jS|. 
Mutzel 1^ gives an integer linear programming formulation for the 2-level pla- 
narization problem, studying the polytope associated with the set of all level 
planar subgraphs of a level graph with 2 levels. 

In order to attack the level planarization problem for k > 2 levels, an integer 
linear programming formulation has to be found, and the polytope associated 
with the set of all level planar subgraphs of a level graph needs to be described. 
Besides, polynomial time separation algorithms need to be developed for prac- 
tical application. One important step in the study of the polytope associated 
with the set of all level planar subgraphs is the characterization of level planar 
graphs in terms of minimal forbidden subgraphs called minimal level non-planar 
subgraph patterns (MLNP-patterns) These graphs are the analogue of and 
K 3 3 in the case of general graph planarity. 

* Funded by Forbairt Basic Research Grant SC/97/611. 



D.-Z. Du et al. (Eds.): COCOON 2000, LNCS 1858, pp. 74-im 2000. 
(c) Springer- Verlag Berlin Heidelberg 2000 



Characterization of Level Non-planar Graphs by Minimal Patterns 



75 



This paper is organized as follows. After summarizing the necessary prelim- 
inaries in the next section, we introduce the minimal level non-planar subgraph 
patterns in Sect.0. We prove that the patterns are minimal level non-planar and 
that the set of patterns is complete for the special case of hierarchies. This result 
is generalized to level graphs in Sect.0 

2 Preliminaries 

A level graph G = (V, E, <f) is a directed acyclic graph with a mapping (j) : V ^ 
{1,2,..., k}, k > 1, that partitions the vertex set F as V = Lh U V 2 U • • • U 14, 
Vj = Vi r\Vj = 0 for i ^ j, such that (j>{v) = (j){u) + 1 for each edge 

(u, u) e E. 

A drawing of a level graph G in the plane is a level drawing if the vertices 
of every Vj, 1 < j < A:, are placed on a horizontal line Ij = {(x, k — j) | x G IR}, 
and every edge (u,v) G E, u G Vj, v G Vj+\, 1 < j < fc, is drawn as a straight 
line segment between the lines U and h+i- A level drawing of G is called level 
planar if no two edges cross except at common endpoints. A level graph is level 
planar if it has a level planar drawing. 

A hierarchy is a level graph G(V, E, (p) where for every v G Vj, j > 1, there 
exists at least one edge {w,v) such that w G Vj-i. That is, all sources appear 
on the first level. 

The characterization of level non-planarity by patterns of subgraphs has 
been suggested earlier by Di Battista and Nardelli who have identified three 
(not necessarily minimal) patterns of level non-planar subgraphs for hierar- 
chies pp. We call these patterns Level Non- Planar (LNP) patterns. To describe 
the LNP patterns, we give some terminology similar to theirs. A path is an 
ordered sequence of vertices (ui,t> 2 ,... ,u„), n > 1 such that for each pair 
(ui,Ui+i), i = 1,2,... ,n — 1 either (ui,Ui+i) or (vi+i,Vi) belongs to E. Let 
i and j, i < j he two levels of a level graph G = {V,E, 4 >). LACE(j,j) denotes 
the set of paths C connecting any two vertices x G Vi and y G Vj such that 
{z G C \ z G Vt, i < t < j}. li G\ and C2 are completely distinct paths be- 
longing to LACE(f,j) then a bridge is a path connecting vertices x G G\ and 
y G G2- Vertices x and y are thus called the endpoints of a bridge. The next 
theorem gives a characterization of LNP patterns for hierarchies, as opposed to 
level graphs. 

Theorem 1 (Di Battista and Nardelli). Let G = {V,E,(j>) he a hierarchy 
with k > 1 levels. G is level planar if and only if there is no triple L\,L2,La G 
LACE(i, j), 0 < i < j < k, that satisfies one of the following conditions: 

(a) Li, L2 and L3 are completely disjoint and pairwise connected by bridges. 
Bridges do not share a vertex with L\, L2 and L\, except in their endpoints 
(see Fig. W(a)); 

(b) Li and L2 share an endpoint p and a path G (possibly empty) starting from 
p, Li n L3 = L2 n L3 = 0; there is a bridge b\ between L\ and L3 and a 
bridge 62 between L2 and L3, 61 n L2 = A*2 H Li = 0 (see Fig.njfh)); 
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(c) Li and L2 share an endpoint p and a path Ci (possibly empty) starting from 
p; Li and L3 share an endpoint q {q ^ p) and a path C2 (possibly empty) 
starting from q, C2C\C\ = 0; L2 and L3 are connected by a bridge b, bC\Li = 0 
(see Fig. W(c))- 






(c) 



Fig. 1. Di Battista-Nardelli’s Level Non-Planar Patterns 



3 Minimal Level Non-planar Patterns in Hierarchies 

Di Battista and Nardelli, as we have remarked, have identified three level non- 
planar patterns for hierarchies. From the recognition of an LNP pattern in a 
hierarchy, one can decide that the hierarchy is not level planar, but one cannot 
guarantee that a removal of an edge from the subgraph matching an LNP pattern 
leads to the level planarity of the graph. 

MLNP patterns are defined to have the following property: If a level graph 
G = (V,E,cj)) matches an MLNP pattern then any subgraph G' {V,E' ,()>) of G, 
with E' = E \ {e}, e G E, is embeddable without crossings on levels. The MLNP 
patterns are divided into three categories: trees, level non-planar cycles, and level 
planar cycles with incident paths. We give a comprehensive description of each 
of these categories and show that the categories are complete for hierarchies. 

The terminology that is used to describe the MLNP patterns is compatible 
with Harary except that we denote by a chain a tree T{V,E) where E = 
{(^1,^2), (^2,^3), . . . , (u|y|_i, U|y|)}. Furthermore, we define some terms that are 
common to all of the patterns. The upper- and lower-most levels that contain 
vertices of a pattern P are called extreme levels of P. The extreme levels of a 
pattern are not necessarily the same as the extreme levels 1 and k of the input 
graph G. If pattern P is instantiated in G as subgraph Gp and the uppermost 
level of Gp in G is i and the lower most level is j {i < j) then the extreme levels 
of P correspond to levels i and j in G. If vertex v lies on an extreme level then 
we call this extreme level the incident extreme level and the other extreme level 
the opposite extreme level of v. 
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3.1 Trees 

Characterization We can characterize a MLNP tree pattern as follows. Let i and 
j be the extreme levels of a pattern and let x denote a root vertex with degree 3 
that is located on one of the levels i, . . . ,j. From the root vertex emerge 3 sub- 
trees that have the following common properties (see Fig. for an illustration): 

— each subtree has at least one vertex on both extreme levels; 

— a subtree is either a chain or it has two branches which are chains; 

— all the leaf vertices of the subtrees are located on the extreme levels, and if 
there is a leaf vertex ti of a subtree S on an extreme level I S {i,j} then v is 
the only vertex of S on the extreme level /; 

— those subtrees which are chains have one or more non-leaf vertices on the 
extreme level opposite to the level of their leaf vertices. 

The location of the root vertex distinguishes the two characterizations. 

T1 The root vertex x is on an extreme level I G {i,j} (see Fig. 0(a)): 

— at least one of the subtrees is a chain starting from x, going to the 
opposite extreme level of x and finishing on x’’s level; 

T2 The root vertex x is on one of the intermediate levels I, i < I < j (see 
Fig.EKb)): 

— at least one of the subtrees is a chain that starts from the root vertex, 
goes to the extreme level i and finishes on the extreme level j ; 

— at least one of the subtrees is a chain that starts from the root vertex, 
goes to the extreme level j and finishes on the extreme level i. 





(a) 



(b) 



Fig. 2. Minimal Level Non-Planar Trees 



Theorem 2. A subgraph matching either of the two tree characterizations T1 
or T2 is minimal level non-planar. 

Proof. The proof of level non-planarity oi T1 and T2 is straightforward by 
matching T1 and T2 to the LNP pattern (a). 

To prove minimality, we consider the two forms of the tree patterns sep- 
arately. Consider T1 where the root vertex x and vertices a, v and w of the 
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chains are located on the same extreme level. Every subtree detached from the 
root- vertex x has a level planar layout. Thus if we remove one of the edges in- 
cident upon the leaf vertices on the extreme level of the root vertex (like the 
vertices v or w in Fig. 13a)) then the corresponding subtree can be embedded 
under the root vertex x and between the other subtrees without any crossings. 
If we remove an edge incident upon the leaf vertices near the opposite level of 
the root vertex (for example, the path from vertex c to the branching point 
in Fig. Ha)) then the modified subtree can be embedded on top of the chain- 
shaped subtree (according to the characterization there has to be one). Next, if 
we remove any other edge, we will have two disconnected subgraphs: one which 
contains the root vertex and the other which does not contain the root vertex. 
The former is a reduced case of the removal of an edge incident to a leaf vertex 
and the other component can be embedded. 

In the case T2 when the root vertex is not on an extreme level, we consider 
two cases: the removal of an edge connecting the leaf vertex of a chain and the 
removal of an edge connecting a leaf vertex of a non-chain subtree. In the former 
case, the two chain subtrees can be embedded on top of each other. In the latter 
case, the path can be embedded under or on top of a chain by repositioning 
either vertices v or u as appropriate. If we remove any other edge then, again, 
we will have two disconnected subgraphs from which the subgraph containing 
the root vertex is a reduced case of the removal of an edge incident to a leaf 
vertex and the other subgraph can be embedded. 

The following three lemmas (stated without proof) and theorem prove that 
the two tree patterns in our characterization are unique. 

Lemma 1. If LNP pattern (a) matches a tree then each one of the paths L\, 
L 2 , L 3 contains only one vertex where the bridges are connected. 



Lemma 2. If LNP pattern (a) matches a tree then its bridges must form a 
subgraph homeomorphic to 



Lemma 3. Only LNP pattern (a) can be matched to a tree. 



Theorem 3. Let T be a tree. T is level non-planar if and only if it matches 
either of the two tree characterizations. 

Proof. From the previous lemmas it is possible to derive a level non-planar tree 
pattern (not necessarily minimal) from LNP pattern (a) only. Consider a tree 
matching pattern (a). If the pattern is bounded by levels i and j, but the vertices 
of bridges occur on levels . . . ,^ 2 > where i < h and h < j then we can re- 
move all the edges of the paths Li which connect vertices on levels i, . . . ,li and 
I 2 , . . ■ , j without affecting level planarity. Moreover we can narrow the range of 
levels l\, . . . ,l 2 even more, until both levels h and I 2 contain at least one vertex 
V whose degree in the subgraph bounded by levels h and 1 2 is greater than 1. 
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Then, it can be shown that the tree between levels /i, . . . , ^2 is homeomorphic to 
one of the MLNP trees. From Lemmas 0 and |2I each of the paths Li has exactly 
one vertex Ci to connect a bridge and the bridges form a subgraph homeomor- 
phic to K 13 . Consequently, after narrowing the levels to ^i, . . . , I 2 , each of the 
new extreme levels li, I 2 contains at least one of the following: 

— a root vertex (x); 

— a vertex of a path from x to Ci (ci included). 

In the latter case, if the vertex, say d, on level li or I 2 is not identical to vertices 
X or Ci, we can remove the part of the upward path Li from the extreme level 
of d to the vertex Ci. The tree maintains level non-planarity since the path Li of 
LNP pattern (a) starts from the vertex d now. After performing this operation 
on each path Li, we obtain a tree that matches either of our characterizations. 

3.2 Cycles 

We now study cycles that are bounded by the extreme levels of the pattern. A 
cycle must then contain at least two distinct paths between the extreme levels 
having vertices of the extreme levels only in their endpoints. These paths are 
called pillars. 



Level Non-planar Cycles 

Theorem 4. If a cycle has more than two distinct paths connecting the vertices 
on the extreme levels of a pattern, it is minimal level non-planar. 

Proof. The number of such paths must be even. So, following our assumption 
of more than two paths, the number of paths must be at least 4 in a level non- 
planar cycle. Without loss of generality, consider the 4-path case first. Let the 
extreme levels be i and j. Let us denote a sequence of paths along the cycle 
A = (Va,... ,Vb), B = {vb,... ,Vc), C = {vc,... ,Vd), D = {vd,... ,Va), and 
Va,Vc S Vi, Vb,Vd S Vj. Consider LNP pattern (c). The paths A, B, C can 
be mapped always to the paths L 2 , Li, L 3 of the pattern, respectively. The 
remaining path D can be then mapped to the bridge in LNP pattern (c) . If the 
number of paths is greater then 4, the first three paths can be mapped as in the 
case of 4 paths, and the remaining paths can be mapped to the bridge. 

Such a cycle is minimal since any edge that is removed from a level non-planar 
cycle results in a chain that can be drawn as a level planar graph. 



Level Planar Cycles Level planar cycles can be augmented by a set of chains 
to obtain minimal level non-planarity. First, we give some terminology related 
to level planar cycles. A vertex that lies on a pillar is called an outer vertex; 
all the remaining vertices are inner vertices. The endpoints of pillars are eorner 
vertices; if an extreme level i has only one vertex it is called a single eorner 
vertex. A bridge in the context of a planar cycle is the shortest walk between 
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corner vertices on the same level; a bridge contains two corner vertices as its 
endpoints and the remainder are inner vertices. A pillar is monotonic if, in a 
walk of the cycle, the level numbers of subsequent vertices of the pillar are 
monotonically increasing or decreasing, depending on the direction of traversal. 
We call two paths or chains parallel if they start on the same pillar and end on 
the same extreme level. If a chain is connected to a cycle by one of its vertices 
having degree 1 (considering only edges of the chain) then this vertex is called 
the starting vertex of the chain and the level where this vertex lies, the starting 
level. The other vertex of degree 1 of the chain is then the ending vertex and 
corresponding level, the ending level. 

Characterization Given a level planar cycle whose extreme levels are i and j, 
there are four cases to consider where augmentation of the level planar cycle by 
paths results in minimal level non-planarity. The pattern cannot contain one of 
the tree patterns given earlier. We enumerate these augmenting paths below. In 
all cases the paths start at a vertex on the cycle and end on an extreme level. 

Cl A single path pi starting from an inner vertex and ending on the opposite 
extreme level of the inner vertex; pi and the cycle share only one vertex. The 
path will have at least one vertex on an extreme level, the end vertex, and 
at most two, the start and end vertices. An example of this is illustrated in 
Fig. 0 (a); 

C 2 Two paths pi and p2, starting, respectively, from vertices and Vp^, Vp^ yf 
Up 2 , of the same pillar L = (u,, . . . , Up^ , . . . , Vp^ , . . .Vj) terminating on ex- 
treme levels j and i, respectively. Vertices Up^ or Vp.^ may be identical to 
corner vertices of L {vp^ = Vi or Vp^ = Vj) only if the corner vertices are 
not single corner vertices on their extreme levels. Paths pi and p2 do not 
have any vertices other than their start (if corner) and end vertices on the 
extreme levels. There are two subcases according to the levels of Vp^ and Vp^. 

- (j){vp,) < ^(up2); 

— 4 ’i'^pi) ^ this means, L must be a non-monotonic pillar. 

FiguresO (b) and (c) illustrate typical subgraphs matching the two subcases, 
respectively; 

C 3 Three paths, p\, p2 and p^. Path pi starts from a single corner vertex and 
ends on the opposite extreme level; paths p2 and ps start from opposite pillars 
and end on the extreme level where the single corner vertex is. Neither p2 
nor p3 can start from a single corner vertex. Figure 0 (d) illustrates a level 
planar cycle augmented by three paths causing level non-planarity; 

C 4 Four paths, pi, p2, P3 and p4. The cycle comprises a single corner vertex 
on each of the extreme levels. Paths pi and p2 start from different corner 
vertices and end on the opposite extreme level to their start with the paths 
embedded on either side of the cycle such that they do not intersect; paths 
P3 and p4 start from distinct non-corner vertices of the same pillar and finish 
on different extreme levels. The level numbers of starting vertices are such 
that they do not cause crossing of the last two paths. See Fig. 0 (e) for an 
illustration. 
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Fig. 3. Level Planar Cycles with Paths 



We will now prove that each of the path-augmented cycles is minimal level 
non-planar and, in Theorems El to Ml prove that this set is complete for hierar- 
chies. 

Theorem 5. Each of the four path-augmented cycles is minimal level non- 
planar. 

Proof. The augmented cycles are level non-planar because it can be shown that 
each can be mapped to one of Di Battista and Nardelli’s LNP patterns. To 
see minimality we consider the three cases of the starting position of the path- 
augmentation on the cycle. 

Suppose the start vertex is an inner vertex of a cycle. Since no subgraph 
matches an MLNP tree, breaking either an edge of the path or an edge of the 
cycle yields a level planar embedding. 

In case of a path-augmented cycle of type C2, the removal of any edge of the 
cycle allows one of the augmenting paths to be embedded through the “gap” left 
by that edge. The removal of any edge of an augmenting path allows that path 
to be embedded on the internal face of the cycle. In both cases no crossings will 
remain. 

For paths starting from corner vertices similar reasoning holds. 



Theorem 6. If a minimal level non-planar graph G eomprises a level planar 
cycle and a single path pi connected to the cycle, then p\ starts from an inner 
vertex of the cycle and ends on the opposite extreme level. 

Proof. For a path to cause level non-planarity, the path must start from an inner 
vertex of the cycle. Otherwise, the path can be embedded on the external face. 
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There are only two possibilities for causing level non-planarity: crossing with 
the incident bridge or, crossing with the opposite bridge. In the former, the path 
in combination with the lower part of the cycle forms a level non-planar tree. 
Since this level non-planar tree is minimal, the combination of the cycle and the 
path is not minimal. Therefore, the latter is the only remaining minimal level 
non-planar case. 



Theorem 7 . If a minimal level non-planar graph G comprises a level planar 
cycle and two paths p\ and p2 connected to the cycle, then p\ and p2 start from 
the same pillar and end on an extreme level and either they cross or they start 
from a non-monotonic sub-chain of the pillar. 

Proof. Neither of the two paths may start from an inner vertex of the cycle be- 
cause otherwise either they can be embedded on the internal face, or at least one 
of them matches type Cl above, or they form a level non-planar tree. Since the 
latter two cases are not minimal both paths must start from a pillar. Moreover, 
they must start from the same pillar, otherwise, the paths can be both embedded 
on the external face. The paths must finish on extreme levels, otherwise they 
can be embedded on the internal face. Moreover, the extreme levels must be 
different for if they are the same, the pattern - although it can be made level 
non-planar by introducing non-monotonic paths and a non-monotonic pillar - 
will not be minimal since it can be shown to match a minimal level non-planar 
tree pattern. If the extreme levels are different, then either the paths cross or 
there is a non-monotonic pillar that causes a crossing of the cycle and a path. 



Theorem 8 . If a minimal level non-planar graph G comprises a level planar 
cycle and three paths p\, p2 and pz connected to the cycle, then G has a single 
corner vertex c\ with pi starting at c\ and extending to the opposite extreme 
level and p2 and ps starting on opposite pillars and ending on the extreme level 
that contains ci. 

Proof. As in the case of two paths, none of the paths may start from an inner 
vertex. Hence, all the paths should start from pillars. Additionally, all the paths 
must end on extreme levels, otherwise, they can be embedded on the internal 
face. No pair of paths can create minimal level non-planarity of the type C2 
above. These conditions are met if one of the paths starts from a single corner 
vertex. If there were no other paths, the path starting from the single corner could 
be embedded on the external face on both sides of the cycle. However, if we have 
two paths starting from different pillars, not from a single corner vertex, and 
ending on the extreme level of the single corner vertex, a level planar embedding 
is not possible. 



Theorem 9 . If a minimal level non-planar graph G comprises a level planar 
cycle and four paths pi, ■ . . ,pa connected to the cycle, then G has two single 
corner vertices c\ and C2 with pi starting at ci and extending to the opposite 
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extreme level of ci, p 2 starting at C 2 and extending to the opposite extreme level 
of C 2 , and ps and p^ starting on the same pillar and diverging to end on opposite 
extreme levels such that they do not cross. 

Proof. This is proved analogously to the previous theorem, considering two cor- 
ner vertices instead of one. 



Theorem 10. If a level non-planar graph G comprises a level planar cycle and 
five or more path augmentations that extend to extreme levels, then G cannot be 
minimal level non-planar. 

Proof. A level non-planar pattern with two parallel paths cannot be minimal 
(since any path that crosses one will cross both or one of the parallel paths is 
redundant) unless one of the parallel paths can be embedded on the other side 
of the cycle, in which case this path starts from a single corner vertex. 

Suppose we have a minimal level non-planar graph comprising a level planar 
cycle and five path augmentations extending to extreme levels. Since removing 
any edge from a minimal level non-planar graph makes it planar, there must 
be four non-crossing paths. This can be achieved only by having on each pillar 
either two diverging paths ending on opposite extreme levels or, parallel paths 
where one pair of paths starts out from a single corner vertex. In neither case is 
it possible to add a fifth path so that minimal level non-planarity holds. 

3.3 Minimal Level Non-planar Subgraphs in Hierarchies 

Having shown that our characterizations of trees, level non-planar cycles and 
path-augmented cycles are minimally level non-planar, it only remains for us 
now to show that this set is a complete characterization of minimal level non- 
planar subgraphs. 

Theorem 11. The set of MLNP patterns characterized in seetion.s ll. H l,V. ‘A and 
I.V. ‘A is complete for hierarchies. 

Proof. Every graph comprises either a tree, or one, or more, cycles. It remains to 
prove that there is no MLNP pattern containing more than one cycle. Suppose 
a graph is MLNP and it has more than one cycle. Then it must be a subcase of 
one of Di Battista and Nardelli’s LNP patterns. Each of these, however, has at 
most one single cycle and the remainder of the patterns comprises chains. Then 
at least one of our cycles must be broken in order to match it to a chain, thus 
contradicting the hypothesis. 

4 Minimal Level Non-planar Snbgraphs in Level Graphs 

We have given in the previous section a characterization of level planar hier- 
archies in terms of minimal forbidden subgraphs. It remains to show that the 
described patterns characterize level planarity for general level graphs as well. 
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Theorem 12. Let G = {V, E, (f>) be a level graph with k > 1 levels. Then G is 
not level planar if, and only if it eontains one of the MLNP patterns as described 
in sections [13 EH anrfEIl 

Proof. If a subgraph Gp of G corresponds to an MLNP pattern, then G must be 
non level planar. 

It remains to proof the opposite direction. Suppose there exists a minimal 
pattern P of non level planarity that is not applicable for hierarchies. Let G be 
a level graph such that P is the only pattern of non level planarity in G. 

Since G is not level planar, augmenting the graph by an incoming edge for 
every source preserving the leveling in the graph constructs a non level planar 
hierarchy H = {V,E LI Eh,(I>), where Eh is the set of all extra added edges. 
Let V be the set of all subgraphs of P[ corresponding to a MLNP pattern. By 
assumption, we have for any Gp GV, Gp = (Vp, Ep, (f) that there exists an edge 
Cp G EpD Eh. Removing for every Gp G V the edge Cp from P[, we construct a 
level planar graph H' . By construction, H' contains G as a subgraph. Since every 
subgraph of a level planar graph must be level planar itself, this contradicts G 
being a non level planar subgraph. 

5 Conclusion 

We have given a characterization of level planar graphs in terms of minimal 
forbidden subgraphs. This description of level planarity is a main contribution for 
solving the MV-h.aid level planarization problem for practical instances. Based 
on the characterization of level planar graphs an integer linear programming 
formulation for the level planarization problem can be given, allowing to study 
the associated polytope in order to develop an efficient branch-and-cut approach. 
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Abstract. A rectangular drawing of a plane graph G is a drawing of 
G such that each vertex is drawn as a point, each edge is drawn as a 
horizontal or a vertical line segment, and the contour of each face is drawn 
as a rectangle. A necessary and sufficient condition for the existence of 
a rectangular drawing has been known only for the case where exactly 
four vertices of degree 2 are designated as corners in a given plane graph 
G. In this paper we establish a necessary and sufficient condition for the 
existence of a rectangular drawing of G for the general case in which no 
vertices are designated as corners. We also give a linear-time algorithm 
to find a rectangular drawing of G if it exists. 



Key words: Graph, Algorithm, Graph Drawing, Rectangular Drawing. 

1 Introduction 

Recently automatic drawings of graphs have created intense interest due to their 
broad applications, and as a consequence, a number of drawing styles and corre- 
sponding drawing algorithms have come out [DETT99] . Among different drawing 
styles a “rectangular drawing” has attracted much attention due to its applica- 
tions in VLSI floorplanning and architectural floorplanning [BS88, H93, H95, 
KH97, KK84, L90, RNN98]. A rectangular drawing of a plane graph G is a 
drawing of G in which each vertex is drawn as a point, each edge is drawn as 
a horizontal or a vertical line segment, and the contour of each face is drawn 
as a rectangle. Note that in a rectangular drawing of a plane graph G the con- 
tour of the outer face of G is also drawn as a rectangle. Fig. Hd) illustrates a 
rectangular drawing of the graph in Fig. me). Glearly not every plane graph has 
a rectangular drawing. For example the graph in Fig. me) has no rectangular 
drawing. 



D.-Z. Du et al. (Eds.): COCOON 2000, LNCS 1858, pp. 8.5-Fni 2000. 
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(e) 



Fig. 1. (a) Interconnection graph, (b) dual-like graph, (c) good designation of 
corners, (d) rectangular drawing, (e) bad designation of corners, and (f) 2-3 
plane graph. 



In a VLSI floorplanning problem, the interconnection among modules is usu- 
ally represented by a plane graph, every inner face of which is triangulated. 
Fig. Eta) illustrates such an interconnection graph of 17 modules. The dual-like 
graph of an interconnection graph is a cubic graph in which every vertex has 
degree 3 [L90]. Fig. ^b) illustrates a dual-like graph of the graph in Fig. EKa), 
which has 17 inner faces. Inserting four vertices a, b, c and d of degree 2 in ap- 
propriate edges on the outer face contour of the dual-like graph as illustrated 
in Fig. me), one wishes to find a rectangular drawing of the resulting graph as 
illustrated in Fig. md). If there is a rectangular drawing, it yields a floorplan of 
modules. Each vertex of degree 2 is a corner of the rectangle corresponding to 
the outer rectangle. Thomassen [T84] gave a necessary and sufficient condition 
for such a plane graph with exactly four vertices of degree 2 to have a rectangu- 
lar drawing. Linear-time algorithms are given in [BS88, H93, KH97, RNN98] to 
obtain a rectangular drawing of such a plane graph, if it exists. However, it has 
not been known how to appropriately insert four vertices of degree 2 as corners. 
If four vertices a, 6, c and d of degree 2 are inserted in edges as in Fig. me), then 
the resulting graph has no rectangular drawing. 
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In this paper we assume that four or more vertices of degree 2 have been 
inserted to all edges that may be drawn as edges incident to corners in a rectan- 
gular drawing; for example, insert one or two vertices of degree 2 to each edge 
on the outer contour. Thus we consider a plane connected graph G in which 
every vertex on the outer contour has degree 2 or 3, every vertex not on the 
outer contour has degree 3, and there are four or more vertices of degree 2 on 
the outer contour. We call such a graph a 2-3 plane graph. (See Fig. ^f).) We do 
not assume that four vertices of degree 2 are designated as corners in advance. 
We give a necessary and sufficient condition for such a 2-3 plane graph G to have 
a rectangular drawing, and give a linear-time algorithm to find appropriate four 
vertices of degree 2 and obtain a rectangular drawing of G with these vertices 
as corners. 

The rest of the paper is organized as follows. Section 2 describes some defi- 
nitions and presents preliminary results. Section 3 presents our main result on 
rectangular drawing. Finally, Section 4 concludes with an open problem. 

2 Preliminaries 

In this section we give some definitions and present preliminary results. 

Let G be a connected simple graph. We denote the set of vertices of G by 
V{G), and the set of edges of G by E(G). The degree of a vertex v is the number 
of neighbors of u in G. We denote the maximum degree of a graph G by Z\. 

A graph is planar if it can be embedded in the plane so that no two edges 
intersect geometrically except at a vertex to which they are both incident. A 
plane graph is a planar graph with a fixed embedding. A plane graph divides the 
plane into connected regions called faces. We regard the contour of a face as a 
clockwise cycle formed by the edges on the boundary of the face. We denote the 
contour of the outer face of graph G by Go(G). We call a vertex not on Go(G) 
an inner vertex of G. 

For a simple cycle G in a plane graph G, we denote by G(G) the plane 
subgraph of G inside G (including G) . An edge of G which is incident to exactly 
one vertex of a simple cycle G and located outside G is called a leg of the cycle 
G. The vertex of G to which a leg is incident is called a leg-vertex of G. A simple 
cycle G in G is called a k-legged cycle of G if G has exactly k legs in G. A fc-legged 
cycle G is a minimal k-legged cycle if G(G) does not contain any other fc-legged 
cycle of G. We say that cycles G and C' in a plane graph G are independent if 
G(G) and G(G') have no common vertex. A set S of cycles is independent if any 
pair of cycles in S are independent. 

A rectangular drawing of a plane graph G is a drawing of G such that each 
edge is drawn as a horizontal or a vertical line segment, and each face is drawn 
as a rectangle. In a rectangular drawing of G, the contour Go(G) of the outer 
face is drawn as a rectangle and hence has four convex corners. Such a corner is 
called a corner of the rectangular drawing. Since a rectangular drawing D oi G 
has no bend and each edge is drawn as a horizontal or a vertical line segment, 
only a vertex of degree 2 of G can be drawn as a corner of D. Therefore, a graph 
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with less than four vertices of degree 2 on CoiG) has no rectangular drawing. 
Thus we consider a 2-3 plane graph G in which four or more vertices on Co{G) 
have degree 2 and all other vertices have degree 3. We call a vertex of degree 2 
in G that is drawn as a corner of D a corner vertex. 

The following result on rectangular drawings is known [T84, RNN98]. 

Lemma 1 . Let G be a 2-3 plane graph. Assume that four vertices of degree 2 
are designated as corners. Then G has a rectangular drawing if and only if G 
satisfies the following three conditions [T84-]: (cl) G has no Tlegged cycles; (c2) 
every 2-legged cycle in G contains at least two designated vertices; and (c3) every 
3-legged cycle in G contains at least one designated vertex. Furthermore one can 
check in linear time whether G satisfies the conditions above, and if G does then 
one can find a rectangular drawing of G in linear time [RNN98]. □ 



It is rather difficult to determine whether a 2-3 plane graph G has a rectangu- 
lar drawing unless four vertices of degree 2 are designated as corners. Considering 
all combinations of four vertices of degree 2 as corners and applying the algorithm 
in Lemma n for each of the combinations, one can determine whether G has a 
rectangular drawing. Such a straightforward method requires time 0{n^) since 
there are 0(n"^) combinations, and one can determine in linear time whether G 
has a rectangular drawing for each combination. In the next section we obtain 
a necessary and sufficient condition for G to have a rectangular drawing for the 
general case in which no vertices of degree 2 are designated as corners. The 
condition leads to a linear-time algorithm. 



3 Rectangular Drawings Without Designated Corners 

The following theorem is our main result on rectangular drawings. 

Theorem 2 . A 2-3 plane graph G has a rectangular drawing if and only if G 
satisfies the following four conditions: 

(1) G has no l-legged cycle; 

(2) every 2-legged cycle in G contains at least two vertices of degree 2; 

(3) every 3-legged cycle in G contains at least one vertex of degree 2; and 

(4) if an independent set S of cycles in G consists of C 2 2-legged cycles and C 3 

3-legged cycles, then 2 c 2 + C3 < 4. □ 

Before proving Theorem El we observe the following fact. 

Fact 3 In any rectangular drawing D of G, every 2-legged cycle of G contains 
at least two corners, every 3-legged cycle of G contains at least one corner, and 
every cycle with four or more legs may contain no corner. □ 
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We now prove the necessity of Theorem El 
Necessity of Theorem El Assume that G has a rectangular drawing D. Then 
G has no 1-legged cycle, since the face surrounding a 1-legged cycle cannot be 
drawn as a rectangle in D. By Fact El every 2-legged cycle in D contains at least 
two corners and every 3-legged cycle in D contains at least one corner. Since a 
corner is a vertex of degree 2, every 2-legged cycle in G must have at least two 
vertices of degree 2 and every 3-legged cycle must have at least one vertex of 
degree 2. 

Let an independent set S consist of C2 2-legged cycles and C3 3-legged cycles 
in G. Then by Fact El each of the C2 2-legged cycles in S contains at least two 
corners, and each of the C3 3-legged cycle in S contains at least one corner. Since 
all cycles in S are independent, they are vertex-disjoint each other. Therefore 
there are at least 2c2 -I- C3 corners in D. Since there are exactly four corners in 
D, we have 2c2 -h C3 < 4. □ 

In the rest of this section we give a constructive proof for the sufficiency of 
Theorem El and show that the proof leads to a linear-time algorithm for finding 
a rectangular drawing of G if it exists. 

We first give some definitions. For a graph G and a set V C V{G), G — V 
denotes a graph obtained from G by deleting all vertices in V together with 
all edges incident to them. For a plane graph G, we define a Go{G)~ component 
as follows. A subgraph F" of G is a Go (G)-component if F consists of a single 
edge which is not in Go(G) and both ends of which are in Go(G). The graph 
G — V {Go{G)) may have a connected component. Add to such a component all 
edges of G, each joining a vertex in the component and a vertex in Go(G). The 
resulting subgraph F of G is a Go (G)-component, too. All these subgraphs F of 
G and only these are the Go(G)-components. 

Let {ui, U2, • • • , Vp-i,Vp}, p > 3, be a set of three or more consecutive vertices 
on Go{G) such that the degrees of the first vertex vi and the last vertex Vp are 
exactly three and the degrees of all intermediate vertices V 2 , U3, • • • , Vp-i are two. 
Then we call the set {u2, U3, • • • , Vp-i} a chain of G, and we call vertices v\ and 
Vp the ends of the chain. Every vertex of degree 2 in G is contained in exactly 
one chain. A chain in G corresponds to an edge of the dual- like graph of an 
interconnection graph into which vertices of degree 2 have been inserted. Two 
chains are consecutive if they have a common end. We now have the following 
lemmas. 

Lemma 4. Let G be a 2-3 plane graph, and let k be the number of vertices of 
degree 3 on Co{G). If a Go{G)~ component F contains a cycle, then F contains 
a cycle with k or less legs. □ 



Lemma 5. Assume that a 2-3 plane graph G satisfies the four conditions in 
Theorem^ and that G has at most three vertices of degree 3 on Go{G). Then G 
has a rectangular drawing. 
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Proof. If G is a cycle, then clearly G has a rectangular drawing. Therefore we 
may assume that G has at least one vertex of degree 3 on Go{G). If G had exactly 
one vertex of degree 3 on Go(G), then G would have a 1-legged cycle or a vertex 
of degree 1, contrary to the assumption that G is a 2-3 plane graph satisfying 
the conditions in Theorem |2J Thus G has either two or three vertices of degree 
3 on Go(G), and hence there are the following two cases to consider. 

Case 1: G has exactly two vertices of degree 3 on Go(G). 

In this case G has exactly one Go (G)-component F. We now claim that F is 
a graph of a single edge. Otherwise, F has a cycle, and then by Lemma 0F has 
a 2-legged cycle, but the 2-legged cycle contains no vertex of degree 2, contrary 
to the assumption that G satisfies Condition (2). 

Clearly the ends of the edge in F are the two vertices of degree 3 on Go(G). 
Since G is a simple graph, the two vertices divide Go(G) into two chains of G. 
Furthermore, G has two 2-legged cycles Gi and G2 as indicated by dotted lines 
in Fig. IK a). Since G satisfies Condition (2), each of Gi and C 2 contains at least 
two vertices of degree 2. We arbitrarily choose two vertices of degree 2 on each 
of Cl and C 2 as corner vertices. Then we can find a rectangular drawing of G 
as illustrated in Fig. m- 




Fig. 2. Illustration for Case 1. 



Case 2: G has exactly three vertices of degree 3 on Go(G). 

In this case G has exactly one Go (G)-component F. The Go (G)-component F 
has no cycle; otherwise, by Lemma El G would have a 2-legged or 3-legged cycle 
containing no vertex of degree 2, contrary to the assumption that G satisfies 
Conditions (2) and (3). Therefore, F is a tree, and has exactly one vertex of 
degree 3 not on Go(G). Thus G is a “subdivision” of a complete graph K 4 of four 
vertices, and has three 3-legged cycles Gi, C 2 and G3 as indicated by dotted lines 
in Fig. Ha). Since G satisfies Condition (3), each 3-legged cycle contains at least 
one vertex of degree 2. We choose four vertices of degree 2; one vertex of degree 
2 on each of the three 3-legged cycles, and one vertex of degree 2 arbitrarily. We 
then obtain a rectangular drawing F of G as illustrated in Fig. EJb). □ 

By Lemma El we may assume that G has four or more vertices of degree 3 on 
Go(G). We now have the following lemmas. 
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Lemma 6. Assume that a 2-3 plane graph G satisfies the four conditions in 
Theorem\^ that G has four or more vertices of degree 3 on Go{G), and that 
there is exactly one Co{G)- component. Then the following (a)-(d) hold: 

(a) for any 2-legged cycle G, at most one chain of G is not on C; 

(b) one can choose four vertices of degree 2 in a way that at most two of them 
are chosen from any chain and at most three are chosen from any pair of 
consecutive chains; 

(c) G has a 3-legged cycle; and 

(d) if G has two or more independent 3-legged cycles, then the set of all minimal 

3-legged cycles in G is independent. □ 



Lemma 7. If a 2-3 plane graph G has two or more Go{G)- components, then G 
has a pair of independent 2-legged cycle. □ 



We are now ready to prove the following lemma. 

Lemma 8. Assume that a 2-3 plane graph G satisfies the four conditions in 
Theorem and that Go{G) has four or more vertices of degree 3. Then G has 
a rectangular drawing. 

Proof. We shall consider the following two cases depending on the number of 
Co (G)-components. 

Case 1: G has exactly one Go (G)-component. 

By Lemma ETC) G has a 3-legged cycle. We shall consider the following two 
subcases depending on whether G has a pair of independent 3-legged cycles or 
not. 

Subcase la: G has no pair of independent 3-legged cycle. 

By Lemma EJb) we can designate four vertices of degree 2 as corners in a way 
that at most two of them are chosen from any chain and at most three are chosen 
from any pair of consecutive chains of G. By Lemma [D it suffices to show that 
G satisfies the three conditions (cl)-(c3) regarding the four designated vertices. 
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Since G satisfies Condition (1) in Theorem El G has no 1-legged cycle and 
hence satisfies (cl). 

We now claim that G satisfies (c2), that is, any 2-legged cycle C in G contains 
at least two designated vertices. By Lemma El) a) at most one chain of G is not 
on G. Since any chain contains at most two of the four designated vertices, G 
must contain at least two of them. 

We then claim that G satisfies (c3), that is, every 3-legged cycle G in G 
has at least one designated vertex. By Condition (3) G has a vertex of degree 
2. Therefore exactly two of the three legs of G lie on Go(G). Let x and y be 
the leg- vertices of these two legs, respectively. Let P be the path on Go{G) 
starting at x and ending at y without passing through any edge on G. Then 
P contains at most two chains; otherwise, either G would have more than one 
Go (G)-components or G would have a pair of independent 3-legged cycles, a 
contradiction. Furthermore, if P contains exactly one chain then it contains at 
most two designated vertices, and if P contains exactly two chains then they are 
consecutive and contain at most three designated vertices. Hence, in either case, 
G contains at least one of the four designated vertices. 

Subcase lb: G has a pair of independent 3-legged cycles. 

Let A4 be the set of all minimal 3-legged cycles in G. By Lemma El)d) Ai is 
independent. Let k = \M.\, then clearly 2 < k and A: < 4 by Condition (4). For 
each 3-legged cycle Cm in Al, we arbitrarily choose a vertex of degree 2 on Cm- 
If fc < 4, we arbitrarily choose A — k vertices of degree 2 on Go(G) which are not 
chosen so far in a way that at most two vertices are chosen from any chain. This 
can be done because G has four or more vertices of degree 2 and every 2-legged 
cycle has two or more vertices of degree 2. Thus we have chosen exactly four 
vertices of degree 2, and we regard them as the four designated corner vertices. 
In Fig. Ha) four vertices a,b,c and d of degree 2 are chosen as the designated 
corner vertices; vertices o, c and d are chosen on three independent minimal 3- 
legged cycles indicated by dotted lines, whereas vertex b is chosen arbitrarily 
on Go(G). By Lemma[I]it suffices to show that G satisfies the conditions (cl)- 
(c3) regarding the four designated vertices. We can indeed prove that G satisfies 
Conditions (cl)-(c3) regarding the four designated vertices. The detail is omitted 
in this extended abstract. 

Case 2: G has two or more Go (G)-components. 

In this case by Lemma 0 G has a pair of independent 2-legged cycles, Gi 
and G 2 . One may assume that both Gi and G 2 are minimal 2-legged cycles. By 
Condition (4) at most two 2-legged cycles of G are independent. Therefore, for 
any other 2-legged cycle G'(yf Gi,G 2 ), G(G') contains either C\ or G 2 . 

Let ki, i = 1 and 2, be the number of all minimal (not always independent) 
3-legged cycles in G(G^). Then we claim that fci < 2, i = 1 and 2. First consider 
the case where Gi has exactly three vertices of degree 3 on Go(G). Then G{Ci) 
has exactly two inner faces; otherwise, G(G^) would have a cycle which has 2 
or 3 legs and has no vertex of degree 2, contrary to Condition (2) or (3). The 
contours of the two faces are minimal 3-legged cycles, and there is no other 
minimal 3-legged cycle in G{Ci). Thus ki = 2. We next consider the case where 
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(a) (b) 

Fig. 4. (a)Illustration for Case 1, and (b) illustration for Case 2. 

Ci has four or more vertices of degree 3 on C'o(G). Then we can show, similarly 
as in the proof of Lemma EKd), that the set of all minimal 3-legged cycles of 
G in G{Ci) is independent. If ki > 3, then Condition (4) would not hold for 
the independent set 5 of /ci -I- 1 cycles: the ki 3-legged cycles in G{Ci) and the 

2- legged cycle Gj, j = 1 or 2 and j ^ i. Thus ki <2. 

We choose two vertices of degree 2 on each Gi, 1 < t < 2, as follows. For 
each of the ki minimal 3-legged cycle in G(Gi), we arbitrarily choose exactly one 
vertex of degree 2. li ki < 2, then we arbitrarily choose 2 — ki vertices of degree 
2 in V{Gi) which have not been chosen so far in a way that at most two vertices 
are chosen from any chain. This can be done because by Condition (2) Gi has at 
least two vertices of degree 2. Thus we have chosen four vertices of degree 2, and 
we regard them as the designated corner vertices for a rectangular drawing of 
G. In Fig. m G has a pair of independent 2-legged cycles C\ and G 2 , k\ =2, 
k 2 = 1, and four vertices a,b,c and d on Go(G) are chosen as the designated 
vertices. Vertices a and d are chosen from the vertices on Gi; each on a minimal 

3- legged cycle in G(Gi). Vertices b and c are chosen from the vertices on G 2 ; c 
is on the minimal 3-legged cycle in G(G 2 ), and b is an arbitrary vertex on G 2 
other than c. 

By Lemma Q it suffices to show that G satisfies the conditions (cl)-(c3) 
regarding the four designated vertices. We can indeed prove that G satisfies 
Conditions (cl)-(c3) regarding the four designated vertices. The detail is omitted 
in this extended abstract. □ 



Lemmas 0 and 0 immediately imply that G has a rectangular drawing if G 
satisfies the conditions in Theorem |2| Thus we have constructively proved the 
sufficiency of Theorem |3 

We immediately have the following corollary from Theorem 0 

Corollary 9. A 2-3 plane graph G has a rectangular drawing if and only if G 
satisfies the following six conditions: 

(1) G has no 1-legged cycle; 

(2) every 2-legged cycle in G has at least two vertices of degree 2; 
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(3) every 3-legged cycle in G has a vertex of degree 2; 

(4~1) at most two 2-legged cycles in G are independent of each other; 

(4--2) at most four 3-legged cycles in G are independent of each other; and 
(4-3) if G has a pair of independent 2-legged cycles Ci and G 2 , then each of 
G{Gi) and G{G 2 ) has at most two independent 3-legged cycles of G and 
the set {Ci,C' 2 ,C 3 } of cycles is not independent for any 3-legged cycle C 3 
in G. □ 

We now have the following theorem. 

Theorem 10. Given a 2-3 plane graph G, one can determine in linear time 
whether G has a rectangular drawing or not, and if G has, one can find a rect- 
angular drawing of G in linear time. □ 



4 Conclusions 

In this paper we established a necessary and sufficient condition for a 2-3 plane 
graph G to have a rectangular drawing. We gave a linear-time algorithm to 
determine whether G has a rectangular drawing, and find a rectangular drawing 
of G if it exists. Thus, given a plane cubic graph G, we can determine in linear 
time whether one can insert four vertices of degree 2 into some of the edges on 
Go{G) so that the resulting plane graph G' has a rectangular drawing or not, and 
if G' has, we can find a rectangular drawing of G' in linear time. It is left as an 
open problem to obtain an efficient algorithm for finding rectangular drawings 
of plane graphs with the maximum degree Z\ = 4. 
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Abstract. We study the problem of optimizing over the set of all com- 
binatorial embeddings of a given planar graph. At IPCO’ 99 we pre- 
sented a first characterization of the set of all possible embeddings of 
a given biconnected planar graph G by a system of linear inequalities. 
This system of linear inequalities can be constructed recursively using 
SPQR-trees and a new splitting operation. In general, this approach may 
not be practical in the presence of high degree vertices. 

In this paper, we present an improvement of the characterization which 
allows us to deal efficiently with high degree vertices using a separation 
procedure. The new characterization exposes the connection with the 
asymmetric traveling salesman problem thus giving an easy proof that 
it is NP-hard to optimize arbitrary objective functions over the set of 
combinatorial embeddings. 

Computational experiments on a set of over 11000 benchmark graphs 
show that we are able to solve the problem for graphs with 100 vertices 
in less than one second and that the necessary data structures for the 
optimization can be build in less than 12 seconds. 



1 Introduction 

A graph is called planar if it admits a drawing into the plane without edge- 
crossings {planar drawing). We call two planar drawings of the same graph 
equivalent when the circular sequence of the edges around each vertex is the 
same in both drawings. The equivalence classes of planar drawings are called 
eombinatorial embeddings. A combinatorial embedding also defines the set of 
cycles in the graph that bound faces in a planar drawing. 

The complexity of embedding planar graphs has been studied by various 
authors in the literature Q In this paper we deal with the following opti- 
mization problem concerning embeddings: Given a planar biconnected graph and 
a cost function on the cycles of the graph, find an embedding II such that the 
sum of the cost of the cycles that appear as face cycles in II is minimized. The 
objective function is chosen only to demonstrate the feasibility of the approach in 
computational experiments. However, our description of the set of combinatorial 
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embeddings of a planar graph as an Integer Linear Program (ILP) makes some 
important NP-hard problems arising in graph drawing accessible to ILP-based 
optimization methods. 

One example is the minimization of the number of bends in an orthogonal 
planar drawing of a graph. This number highly depends on the chosen planar em- 
bedding. Whereas bend minimization of a planar graph is NP-hard ([ 01 , a branch 
and bound algorithm for the problem is given in jO] ) , it can be solved in polyno- 
mial time for a fixed embedding (PI)- Figured shows two different orthogonal 
drawings of the same graph that were generated using the bend minimization 
algorithm by Tamassia m)- The algorithm used different combinatorial em- 
beddings as input. Drawing |l(a)| has 13 bends while drawing |l(b)| has only 7 
bends. 





Fig. 1. The impact of the chosen planar embedding on the drawing 



For a fixed embedding, the bend minimization problem can be formulated 
as a flow problem in the geometric dual graph. This is the point where we plan 
to use our description: Once we have characterized the set of all embeddings 
via an integer linear formulation on the variables associated with cycles of the 
graph, we can combine this formulation with the flow problem to solve the bend 
minimization problem over all embeddings. 

In 1 1 we introduced an integer linear program (ILP) whose set of feasible 
solutions corresponds to the set of all possible combinatorial embeddings of a 
given biconnected planar graph. The program is constructed recursively with the 
advantage that we only introduce variables for those simple cycles in the graph 
that form the boundary of a face in at least one combinatorial embedding of the 
graph, thus reducing the number of variables tremendously. The constraints are 
derived using the structure of the graph. We use a data structure called SPQR- 
tree suggested by Di Battista and Tamassia (| 2 |) for the on-line maintenance of 
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triconnected components. SPQR-trees can be used to code and enumerate all 
possible combinatorial embeddings of a biconnected planar graph. 

The problem of our original formulation was that it may be inefficient for 
graphs with high-degree vertices, because it requires to compute the convex 
hull of a number of points that may be exponential in the degree of a vertex. 
When looking for a solution to this problem, we discovered a connection of the 
embedding problem with the asymmetric traveling salesman problem (ATSP) 
enabling us to apply the same machinery used for solving the ATSP-problem to 
the problem of finding an optimal embedding for a planar biconnected graph. 

Our computational results on a set of more than 11000 benchmark graphs 
show that our new approach preserves the positive properties of our first ap- 
proach while making it possible to deal efficiently with high degree vertices. As 
in our first version, the size of the integer linear system computed for the bench- 
mark graphs grows only linearly with the size of the problem and the computed 
systems can be solved fast using a mixed integer program solver. There are only 
a few graphs with high degree vertices for which we need to apply our separation 
procedure and we never need more than three separation steps. 

Section O gives a brief overview of SPQR-trees. In Section 01 we sketch the 
recursive construction of the linear constraint system using a splitting operation 
on the SPQR-tree. Section 0 introduces the connection of our problem with 
the ATSP problem and describes the separation procedure. Our computational 
results are described in Section 01 

2 SPQR- Trees 

In this section, we give a brief overview of the SPQR-tree data structure for bi- 
connected graphs. SPQR-trees have been suggested by Di Battista and Tamassia 
([2]). They represent a decomposition of a biconnected graph into triconnected 
components. A connected graph is triconnected, if there is no pair of vertices in 
the graph whose removal splits the graph into two or more components. 

An SPQR-tree has four types of nodes and with each node is associated a 
biconnected graph which is called the skeleton of that node. This graph can 
be seen as a simplified version of the original graph and its vertices are also 
contained in the original graph. The edges in a skeleton represent subgraphs of 
the original graph. The node types and their skeletons are as follows: 

1. Q-node: The skeleton consists of two vertices that are connected by two 
edges. One of the edges represents an edge e of the original graph and the 
other one the rest of the graph. 

2. S'-node: The skeleton is a simple cycle with at least 3 vertices. 

3. P-node: The skeleton consists of two vertices connected by at least three 
edge. 

4. P-node: The skeleton is a triconnected graph with at least four vertices. 

All leaves of the SPQR-tree are Q-nodes and all inner nodes S-,P or P- 
nodes. When we see the SPQR-tree as an unrooted tree, then it is unique for 
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each biconnected planar graph. Another important property of these trees is 
that their size (including the skeletons) is linear in the size of the original graph 
and that they can be constructed in linear time 

As described in 0, SPQR-trees can be used to represent all combinatorial 
embeddings of a biconnected planar graph. This is done by choosing embeddings 
for the skeletons of the nodes in the tree. The skeletons of S- and Q-nodes are 
simple cycles, so they have only one embedding. Therefore, we only have to look 
at the skeletons of R- and P-nodes. The skeletons of i?-nodes are triconnected 
graphs. Our definition of combinatorial embeddings distinguishes between two 
combinatorial embeddings of a triconnected graph, which are mirror-images of 
each other (the circular order of the edges around each vertex in clockwise order 
is reversed in the second embedding). The number of different embeddings of a 
P-node skeleton is {k — 1)! where k is the number of edges in the skeleton. 

Every combinatorial embedding of the original graph defines a unique com- 
binatorial embedding for each skeleton of a node in the SPQR-tree. Conversely, 
when we define an embedding for each skeleton of a node in the SPQR-tree, we 
define a unique embedding for the original graph. Thus, if the SPQR-tree of G 
has r P-nodes and the P-nodes Pi to Pk where the skeleton of Pi has Li edges, 
then the number of combinatorial embeddings of G is exactly 

k 

2'-n(i,-i)! . 

i=l 

Because the embeddings of the P- and P-nodes determine the embedding of the 
graph, we call these nodes the decision nodes of the SPQR-tree. 

3 Recursive Construction of the Integer Linear Program 

3.1 The Variables of the Integer Linear Program 

A face cycle in a combinatorial embedding of a planar graph is a directed cycle 
of the graph with the following property: In any planar drawing realizing the 
embedding, the left side of the cycle is empty. Note that the number of face 
cycles of a planar biconnected graph with m edges and n vertices is m — n + 2. 

We construct an integer linear program (ILP) where the feasible solutions 
correspond to the combinatorial embeddings of a graph. The variables of the 
program are the same as in ra. They are binary and represent directed cycles 
in the graph. In a feasible solution of the ILP, a variable Xc has value I if the 
associated cycle c is a face cycle in the represented embedding and 0 otherwise. 
To keep the number of variables as small as possible, we only introduce variables 
for those cycles that are indeed face cycles in a combinatorial embedding of the 
graph. 

3.2 Splitting an SPQR-Tree 

We construct the variables and constraints of the ILP recursively. Therefore, 
we need an operation that constructs a number of smaller problems from our 
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original problem such that we can use the variables and constraints computed 
for the smaller problems to compute the ILP for the original problem. This is 
done by splitting the SPQR-tree at some decision-node v. 

The splitting operation deletes all edges in the SPQR-tree incident to v whose 
other endpoint is not a Q-node, thus producing smaller trees. Then we attach 
new Q-nodes to all nodes that were incident to deleted edges to make sure that 
the trees we produce are again SPQR-trees. The new edges we use to attach the 
Q-nodes are called split-edges and the trees we produce split-trees. The graphs 
represented by the split-trees are called split-graphs. The new tree containing 
V is the center split-tree and the associated graph the center split-graph. The 
split-trees either have only one decision node (like the center split-tree) or at 
least one less than the original tree. The splitting process is depicted in Fig. 0 




Fig. 2. Splitting an SPQR-tree at an inner node 



3.3 Construction of the ILP for SPQR- Trees 

Since this topic is treated in detail in H2| and HH , we only give a short sketch of 
our approach. Let T be the SPQR-tree of a biconnected planar graph G, v the 
node used for splitting the tree and Ti,. . . ,Tk the split-trees of v. We assume 
that Ti is the center split-tree and that the graph Gi is the split-graph belonging 
to split-tree Ti. We can distinguish two types of directed cycles in G: 

1. Local cycles are contained in one of the graphs Gi, . . . , Gfc. 

2. Global cycles of are not contained in any of the Gi. 

We assume that we have already computed the ILP li for each Ti . The variables 
in li that represent local cycles will also be variables in the ILP for T. We 
compute the global cycles of G for which we need variables by combining cycles 
in the split-graphs that are represented by variables in the L. 

The set G of all constraints of the ILP of T is given by G = G/ U Gc U Gg. G/ 
is the set of lifted constraints. For each constraint contained in L, we compute 
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a constraint that is valid for T by replacing each variable Xc by the sum of the 
variables in R{xc) (The set of variables for T whose associated cycles have been 
constructed using c). 

The set Cc is the set of choice constraints. They state that for each variable 
Xc computed for Ti, the sum of the variables in the set R{xc) can be at most 
one. This is true because all of the cycles in R{xc) either pass an edge or one 
of the split-graphs in the same direction and therefore at most one of the cycles 
can be a face cycle in any embedding. The proof is omitted but can be found in 

El. 

The only element of Cg is the center graph constraint, which states that the 
number of global face-cycles in any feasible solution plus the number of local 
face-cycles contained in Gi is equal to the number of faces of Gi. This is true, 
because any drawing of G can be generated from a drawing of Gi by replacing 
some edges by subgraphs. In this process, the face cycles of Gi may be replaced 
by global cycles or are preserved (if they are local cycles of G) . 

We observe that a graph G whose SPQR-tree has only one inner node is 
isomorphic to the skeleton of this node. Therefore, the ILP for an SPQR-tree 
with only one inner node is defined as follows: 

— 5'-node: When the only inner node of the SPQR-tree is an S-node, G is a 
simple cycle. Thus it has two directed cycles and both are face-cycles in the 
only combinatorial embedding of G. So the ILP consists of two variables, 
both of which must be equal to one. 

— i?-node: G is triconnected and according to our definition of combinatorial 
embeddings, every triconnected graph has exactly two embeddings, which 
are mirror-images of each other. When G has m edges and n vertices, we 
have k = 2{m — n + 2) variables and two feasible solutions. The constraints 
are given by the convex hull of the two points in fc-dimensional space that 
correspond to the solutions. 

— P-node: The ILP for graphs whose only inner node in the SPQR-tree is a P- 
node is described in detail in Section 0 because this is where the connection 
to the asymmetric travelings salesman problem (ATSP) is used. 

The proof of correctness differs from the proof given in m and more detailed 
in E! only in the treatment of the skeletons of P-nodes, so we only look at the 
treatment of P-node skeletons in more detail in the next section. 

4 P-Node Skeletons and the ATSP 

A P-node skeleton P consists of two vertices Va and Vb connected by A: > 3 edges 
and has therefore (fc — 1)! different combinatorial embeddings. Every directed 
cycle in P is a face cycle in at least one embedding of P, so we need — fc 
variables in an ILP description of all combinatorial embeddings. 

Let G be the set of variables in the ILP and c : G ^ K a weight function on G. 
We consider the problem of finding the embedding of P that minimizes the sum 
of the weights of the cycles that are face cycles. We will show that this problem 
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can be stated as the problem of finding a Hamiltonian cycle with minimum 
weight in the complete directed graph B = {V,E) with k vertices or simply 
as the asymmetric traveling salesman problem (ATSP). The transformation we 
will show here also works in the opposite direction thus showing NP-hardness 
of optimizing over all embeddings. But we show here the reduction to ATSP 
because this is what our algorithm does when it computes the ILP. 

The complete directed graph B has one vertex for every edge of P. Let ei 
and 62 be two edges in P and vi and V2 the corresponding vertices in B. Then 
the edge (vi,V2) corresponds to the cycle in P that traverses edge ei from Va to 
Vb and edge 62 from Vb to Va- The edge (t’2,^'i) corresponds to the same cycle 
in the opposite direction. In this way, we define a bijection b : C E from the 
cycles in P to the edges in B. This bijection defines a linear function c' : if ^ K 
on the edges of B such that c'(e) = c{b~^{e)). 

Now it is not hard to see that each embedding of P corresponds to a Hamilto- 
nian cycle in B and vice versa. If the Hamiltonian cycle is given by the sequence 
(t>i , U2, . . . , Ufe) of vertices, then the sequence of the edges in P in the correspond- 
ing embedding in counter-clockwise order around Va is (ei, 62, ... , 6fc). Figure El 
shows an example of a P-node skeleton with four edges and the corresponding 
ATSP-graph. The embedding of the P-node skeleton on the left corresponds 
to the Hamiltonian cycle marked in the ATSP-graph. The marked edges corre- 
spond to the face cycles in the embedding of the P-node skeleton. The sequence 
of the edges in P in counter-clockwise order around vertex Va corresponds to the 
sequence of the vertices in the Hamiltonian cycle of the ATSP-graph. 




Fig. 3. A P-node skeleton and its corresponding ATSP-graph 



The sum of the weights of all edges on the Hamiltonian cycle is equal to the 
sum of the weights of the cycles of P that are face cycles in this embedding. So 
finding an embedding of P that minimizes the sum of the weights of the face 
cycles is equivalent to finding a traveling salesman tour with minimum weight in 
B. Since we can easily construct a corresponding P-node embedding problem for 
any ATSP-problem, we have a simple proof that optimizing over all embeddings 
of a graph is in general NP-hard. 

It also enables us to use the same ILP used for ATSP for the ILP that 
describes all combinatorial embeddings of a P-node skeleton. The formulation 
for the ATSP ILP found in [ 7 | has two types of constraints. 
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1. The degree eonstraints state that each vertex must have exactly one incoming 
edge and one outgoing edge in every solution. 

2. The subtour elimination eonstraints state that the number of edges with 
both endpoints in a nonempty subset S of the set of all vertices can be at 
most \S\ — 1. 

The number of degree constraints is linear in the number of edges in a P-node 
skeleton, while the number of subtour elimination constraints is exponential. 
Therefore, we define the ILP for a graph whose SPQR-tree has a P-node as 
the only inner node just as the set of degree constraints for the corresponding 
ATSP-problem. 

To cope with the subtour elimination constraints, we store for each P-node 
skeleton the corresponding ATSP-graph. For each edge in the ATSP-graph we 
store the corresponding cycle in the P-node skeleton. During the recursive con- 
struction, we update the set of corresponding cycles for each edge in the ATSP- 
graph, so that we always know the list of cycles represented by an edge in the 
ATSP-graph. This is done in the same way as the construction of the lifted 
constraints in subsection It -.SI 

When the construction of the recursive ILP is finished, we use a mixed integer 
programming solver to find an integer solution. Then we check if any subtour 
elimination constraint is violated. We do this by finding a minimum cut in each 
ATSP-graph. The weight of each edge in the ATSP graph is defined as the sum 
of the values of the variables representing the cycles associated with the edge. 
If the value of this minimum cut is smaller than one, we have found a violated 
subtour elimination constraint and add it to the ILP. As we will show in the 
next section, separation of the subtour elimination constraint is rarely necessary. 

5 Computational Results 

In our computational experiments, we used a benchmark set of 11491 graphs 
collected by the group around G. Di Battista in Rome (P). Since some of these 
graphs are not planar, we first computed a planar subgraph using the algorithm 
from m- Then we made the resulting graphs biconnected while preserving pla- 
narity using the algorithm from 0 . For the resulting graphs, we first computed 
the recursive part of our ILP and then used a branch and cut algorithm with 
CPLEX as integer program solver to optimize randomly chosen linear functions 
over the set of all combinatorial embeddings of the graph. After each optimiza- 
tion phase, we checked if there was a violated subtour elimination constraint and 
if this was the case, we added the found constraint and re-optimized the ILP. 
Our experiments ran on a Sun Enterprise 10000. 

Figure |4(a)| shows that the number of embeddings can vary greatly for graphs 
with similar size. One graph with 97 vertices and 135 edges had 2,359,296 com- 
binatorial embeddings while another one with 100 vertices and 131 edges had 
only 64 embeddings (note that the y-axis in the figure is logarithmic). 

Figure p:(b)| shows that the number of constraints and variables of our formu- 
lation grows roughly linear with the size of the graphs which is surprising when 



Computing Optimal Embeddings for Planar Graphs 103 



1e+07 

1e+06 

100000 

10000 

1000 

100 

10 




2000 

1800 

1600 

1400 

1200 

1000 

800 

600 

400 

200 







(a) (b) 

Fig. 4. The number of embeddings of the tested graphs and the number of 
variables and constraints 



we consider the growth of the number of embeddings. Our ILP always has more 
constraints than variables. Figure ^(a)| shows the time needed for building the 
recursive ILP and the time needed for optimization including separation. Opti- 
mization is very fast and the longest time we needed was 0.75 seconds. The time 
needed for building the ILP grows sub-exponential with the number of vertices 
and never exceeded 11 seconds. 

Figure |5(b)| shows that separation was rarely necessary and in the cases 
where we separated constraints, the number of the separation steps was small. 
The boxes show the number of graphs that needed 0, 1, 2 or 3 separation steps, 
e.g. 1,2,3 or 4 optimization rounds (note that the y-axis is logarithmic). We 
needed at most three separation steps and this was only the case for one of the 
11,491 graphs. For 11,472 of the graphs, no re-optimization was necessary. 

Our future goal will be to extend our formulation such that each solution 
will correspond to an orthogonal representation of the graph. This will enable 
us to find drawings with the minimum number of bends over all embeddings. Of 
course, this will make the solution of the ILP much more difficult. 

Acknowledgments We thank the group of G. Di Battista in Rome for giving 
us the opportunity to use their implementation of SPQR-trees in GDToolkit, a 
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Abstract. This paper presents polynomial-time approximation algo- 
rithms for the problem of computing a maximum independent set in 
a given map graph G with or without weights on its vertices. If G is 
given together with a map, then a ratio of 1-1-5 can be achieved in O(n^) 
time for any given constant 5 > 0, no matter whether each vertex of G is 
given a weight or not. In case G is given without a map, a ratio of 4 can 
be achieved in 0{rJ) time if no vertex is given a weight, while a ratio 
of O(logn) can be achieved in 0{n7 log n) time otherwise. Behind the 
design of our algorithms are several fundamental results for map graphs; 
these results can be used to design good approximation algorithms for 
coloring and vertex cover in map graphs, and may Hnd applications to 
other problems on map graphs as well. 



1 Introduction 

An independent set in a graph G = (V, E) is a subset I oi V such that no two 
vertices of I are connected by an edge in E. If each vertex of G is associated with 
a weight, the weight of an independent set / in G is the total weight of vertices 
in /. Given a vertex-weighted graph G = (V,E), the maximum independent set 
(MIS) problem requires the computation of an independent set of maximum 
weight in G. The special case where each vertex of G has weight 1, is called the 
unweighted maximum independent set (UMIS) problem. The MIS problem is a 
fundamental problem in many areas; unfortunately, even the UMIS problem is 
widely known to be NP-hard. 

Since the MIS problem is NP-hard, we are interested in designing approxi- 
mation algorithms, i.e. polynomial-time algorithms that find an independent set 
of large but not necessarily maximum weight in a given graph. An approxima- 
tion algorithm A achieves a ratio of p if for every vertex-weighted graph G, the 
independent set / found by A on input G has weight at least where 

OPT(G) is the maximum weight of an independent set in G. 

Hastad jOj gave strong evidence that no approximation algorithm for the 
UMIS problem achieves a ratio of n^~'^ for any e > 0, where n is the number 
of vertices in the input graph. Due to this, a lot of work has been devoted to 
designing approximation algorithms for the MIS problem restricted to certain 

D.-Z. Du et al. (Eds.): COCOON 2000, LNCS 1858, pp. lO.^j- TTPri 2000. 

(c) Springer- Verlag Berlin Heidelberg 2000 
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classes of graphs (such as planar graphs, bounded-degree graphs, etc) . This paper 
considers the MIS problem restricted to map graphs, which were introduced by 
Chen et al. j^l recently. 

Intuitively speaking, a map is a plane graph whose faces are classified into 
nations and lakes. Traditional planarity says that two nations on a map M. are 
adjacent if and only if they share at least a borderline. Motivated by topological 
inference, Chen et al. considered a modification of traditional planarity, which 
says that two nations on M. are adjacent if and only if they share at least a 
border-point. Consider the simple graph G whose vertices are the nations on Ai 
and whose edges are all {/i, / 2 } such that /i and /2 are nations sharing at least 
a border-point. G is called a map graph prmj : if for some integer k, no point in 
A4 is shared by more than k nations, G is called a k-map graph jS|. The UMIS 
problem restricted to map graphs is NP-hard, because planar graphs are exactly 
3-map graphs |S| and the UMIS problem restricted to planar graphs is NP-hard. 

It is known m that the MIS problem restricted to planar graphs has a 
polynomial-time approximation scheme (PTAS). Recall that a PTAS for a maxi- 
mization (respectively, minimization) problem 7T is a polynomial-time algorithm 
A such that given an instance / of iT and an error parameter e > Q, A computes 
a solution S' of / in time polynomial in the size of / such that the value of S is 
at least (1 — e) • OPT {I) (respectively, at most (1 -|- e) • OPT{I)), where OPT (I) 
is the optimal value of a solution of I. An interesting question is to ask whether 
the MIS problem restricted to map graphs also has a PTAS. Regarding this 
question, we show that if the input graph is given together with a map, then 
the MIS problem restricted to map graphs has a practical PTAS. Combining 
this result and the Thorup algorithm uni for constructing a map from a given 
map graph, we obtain a PTAS for the MIS problem restricted to map graphs. 
Unfortunately, the Thorup algorithm is extremely complicated and the exponent 
of its polynomial time bound is about 120. 

So, it is natural to assume that no map is given as part of the input and no 
map construction is allowed in approximation algorithms. Under this assump- 
tion, we first show that the MIS problem restricted to A:-map graphs for any 
fixed integer k has a PTAS. This PTAS runs in linear time, but is impractical 
because it uses the impractical Bodlaender algorithm |2j for tree-decomposition. 
Using this PTAS, we then obtain an 0(n^ )-time approximation algorithm for the 
UMIS problem restricted to general map graphs that achieves a ratio of 4. We 
also give two relatively more practical approximation algorithms. One of them is 
for the MIS problem restricted to general map graphs, runs in 0(n^ log n) time, 
and achieves a ratio of O(logn). The other is for the UMIS problem restricted 
to general map graphs, runs in O(n^) time, and achieves a ratio of 5. 

Behind the design of our algorithms are several fundamental results for map 
graphs; these results can be used to obtain the following results. First, there is 
a PTAS for the minimum weighted vertex cover (MWVC) problem restricted to 
map graphs. Second, a map graph G with n vertices and m edges can be colored 
in 0{n^m) time using at most 2 • x(G) colors, where x(G) is the chromatic 
number of G. Third, for all optimization problems 77 such that 77 restricted 
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to planar graphs has a PTAS as shown in ^ restricted to /c-map graphs 
has a PTAS as well for any fixed integer k. These results do not involve map 
constructions. They may find applications to other problems on map graphs as 
well. 

The rest of this paper is organized as follows. Section E| gives precise defi- 
nitions and states known results that are needed later on. Section E| presents a 
PTAS for the MIS problem where the input graph is given together with a map. 
Section 0 describes two PTASs for the MIS problem restricted to fc-map graphs. 
Section 0 presents nontrivial approximation algorithms for the MIS and UMIS 
problems restricted to map graphs. Section asks several open questions. 

See http://rnc2.r. dendai.ac.jp/~chen/papers/mapis.ps.gz, for a full version 
of this paper. 

2 Preliminaries 

Throughout this paper, a graph may have multiple edges but no loops, while a 
simple graph has neither multiple edges nor loops. Let G = (V, E) be a graph. 
The subgraph of G induced by a subset C7 of M is denoted by G[U]. An indepen- 
dent set of G is a subset U of V such that G[U] has no edge. The independence 
number of G, denoted by a{G), is the maximum number of vertices in an in- 
dependent set of G. Let v G V. The degree of u in G is the number of edges 
incident to v in G; v is an isolated vertex if its degree is 0. The neighborhood 
of V in G, denoted by Nc{v), is the set of vertices adjacent to v in G. Note 
that |AfG(n)| bounds the degree of u in G from below. For a subset U of V, let 
Ng(U) = idy^uNGiv)- 

Let f be a plane graph. Let / be a face of £. The boundary of / is the 
graph {Vf,Ef), where Vf and Ef are the set of vertices and edges surrounding 
/, respectively. The size of /, denoted by |/|, is \Vf\. We call / a cycle face if its 
boundary is a simple cycle. 

A map Ad is a pair where (1) £ is a plane graph {V,Es) and each 

connected component of £ is biconnected, and (2) is a function that maps 
each inner face / of £ to 0 or 1 in such a way that whenever <p{f) = 1, / is 
a cycle face. A face f of £ is called a nation on Ad if / is an inner face with 
ip{f) = 1, while called a lake on Ad otherwise. For an integer k, a k-point in Ad 
is a, p G V that appears on the boundaries of exactly k nations on Ad; a k~^ -point 
in Ad is a p G P that appears on the boundaries of more than k nations on Ad . 
Ad is a k-map if there is no fc'''-point in Ad. Two nations are adjacent if their 
boundaries share a p G V. The graph of Ad is the simple graph G = (V,Eg) 
where V consists of the nations on Ad and Eg consists of all {fi, f 2 } such that 
/i and /2 are adjacent. We call G a map graph, while call G a k-map graph if Ad 
is a A:-map. For clarity, we call the elements of V points, while call the elements 
of V vertices. A nation / on Ad is isolated if / is an isolated vertex of G. 

The half square of a bipartite graph H = (X,Y; Eh) is the simple graph 
G = {X,E) where E = {{xi,X 2 } \ there is a y G Y with {xi,y} G Eh and 
{x2,y} G Eh}. 
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Lemma 1. jS] Every map graph G = (V,E) is the half square of a bipartite 
planar graph H = {V,V]Eh) with \V\ < 3|y| — 6. 

For a map Ai = {£, (f) and a face / in £, we say that / touches a point p if 
p appears on the boundary of /. 



3 A PTAS for the Case with a Given Map 

Let G = {V,E) be the input map graph given with a map M = {{V,E£),(p) and 
a nonnegative weight to each nation on At. Let e be the given error parameter. 

Let H be the bipartite planar graph (V,V;Eh) where Eh = {{v,p} \ v G 
V,p € V, and p is a point on the boundary of nation ?;}. Obviously, E[ can be 
constructed in linear time from Ad. Moreover, G is the half square of E[. Indeed, 
Lemma ^ is proved by further removing all redundant points from H where a 
point p G V is redundant if for every pair {m, r;} of neighbors of p in El, there is 
a point q G V — {p} with {u,u} C Nniq)- 

Our PTAS is a nontrivial application of the Baker approach. Let h = [i] . We 
may assume that G and hence H are connected. Starting at an arbitrary vertex 
r G V, we perform a breadth-first search (BFS) on H to obtain a BFS tree Th- 
For each vertex v in H, we define the level number i(v) of v to be the number of 
edges on the path from r to f in Th- Note that only r has level number 0. For 
each i G {0, . . . , /i — 1}, let be the subgraph of H obtained from H by deleting 
all vertices v with £{v) = 2i (mod 2h). Obviously, each v G V appears in exactly 
h — 1 of Hq through Hh-i - Moreover, Hi is clearly a {2h— l)-outerplanar graph. 

For each i G {0, . . . , /i — 1}, let Gi be the half square of Hi. The crux is 
that each Gi is an induced subgraph of G, because all p G 7^ appear in Hi. Our 
strategy is to compute an independent set li of maximum weight in Gi for each 
i G {0,. . . ,h — 1}. Since each vertex of G appears in all but one of Go through 
Gh-i, the independent set of maximum weight among Iq, . . . , Ih-i has weight at 
least (1— ■OPT{G) > (1 — e) ■OPT{G). Recall that OPT{G) is the maximum 
weight of an independent set in G. 

So, it remains to show how to compute a maximum weight independent set A 
in Gi for each i G {0,...,/i— Ij.Todo this, we first recall the notion of treewidth. 
A tree- decomposition of a graph K = {Vk, Ek) is a pair {{Xj \ j G J}, T), where 
{Xj I j G J} is a family of subsets of Vk and T is a tree with vertex-set J such 
that the following hold: 

— iljejXj = Vk- 

— For every edge {u, w} G Ek, there is a j G J with {ri, w} C Xj. 

— For all ji , j 2 , j 3 G J, if j 2 lies on the path from ji to j '3 in T, then Xj, n Xj^ C 

Each Xj is called the hag of vertex j. The treewidth of a tree-decomposition 
{{Xj I j G J},P) is max{|Aj| — 1 | j S J}. The treewidth of K is the min- 
imum treewidth of a tree-decomposition of K, taken over all possible tree- 
decompositions of K. 
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It is widely known that the MIS problem restricted to graphs with bounded 
treewidth can be solved in linear time. However, the treewidths of our graphs 
Go, . - . , Gh-i may not be bounded. Our strategy is to resort to Hq, . . . , Hh-i 
instead, and do a dynamic programming on them. The result is the following: 

Theorem 1. There is a PTAS for the special case of the MIS problem restricted 
to map graphs where the input graph is given together with a map Ai = 

It runs in 0{h\£\^ + time, where h = \-~\ and € is the given error 

parameter. 

4 PTASs for fc-Map Graphs 

We use an extension of the Baker approach, suggested in The only difference 
between the extension and the Baker approach is that instead of decomposing the 
given graph G into /c'-outerplanar subgraphs, we decompose G into subgraphs 
of bounded treewidth such that each vertex of G appears in all but one of these 
subgraphs. The next lemma shows that such a decomposition is possible for a 
fc-map graph. 

Lemma 2. Given a k-map graph G and an integer h > 2, it takes 0{h\G\) time 
to compute a list C of h induced subgraphs of G such that (1) each subgraph in 
C is of treewidth < k{6h — 4) and (2) each vertex of G appears in exactly h — 1 
of the subgraphs in C. 

By the proof of Lemma 0 for all optimization problems U such that U 
restricted to planar graphs has a PTAS as shown in P, we can emulate Baker 
P to show that 77 restricted to 7-map graphs has a PTAS as well for any fixed 
integer k. In particular, we have: 

Theorem 2. For any fixed integer k, there is a PTAS for the MIS problem 
restricted to k-map graphs. It runs in time, where n is the 

size of the given k-map graph, h = |"f ] , and e is the given error parameter. 

The PTAS in Theorem 0 is impractical. We next show a different PTAS for 
the special case of the UMIS problem where the input graph is given together 
with a 7-map. This PTAS is based on the Lipton and Tarjan approach. Although 
this PTAS is not better than the one in Sectional the tools used in its design 
seem to be fundamental. Indeed, the tools will be used in the next section, and 
we believe that they can be used in other applications. 

A map A4 — (S,^) is well-formed if (1) the degree of each point in £ is at 
least 3, (2) no edge of £ is shared by two lakes, (3) no lake touches a point whose 
degree in £ is 4 or more, and (4) the boundary of each lake contains at least 4 
edges. 

The next lemma shows that a map can easily be made well-formed. 

Lemma 3. Given a map A4 = (£, (p) with at least three nations and without 
isolated nations, it takes 0(|£|) time to obtain a well-formed map A4' such that 
(1) the graph of Ai' is the same as that of AA and (2) for every integer k >2, 
A4 is a k-map if and only if so is AA' . 
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Let Ai = (£, (p) he a, well- formed map. Then, each point on the boundary of 
a lake on is a 2-point in Ai and has degree 3 in £; moreover, for all integers 
i > 3, each i-point in A4 has degree i in £. Using these facts, we can prove the 
following lemma which improves a result in ^ significantly. 

Lemma 4. Let k > 3 and G be a k-map graph with n > 3 vertices and m edges. 
Then, m < kn — 2k. 

From Lemma 0, we can prove the following corollary. 

Corollary 1. The following hold: 

1. Given an integer k and a k-map graph G, it takes 0(|G|) time to color G 
with at most 2k colors. Consequently, given a k-map Ai = {£,Lp), it takes 
0(k\£\) time to color the graph of Ai with at most 2k colors. 

2. Given a map graph G = (V,E), we can color G in 0(|Up • |if|) time using 
at most 2 ■ x(G) colors, where x(G) is the chromatic number of G. 

Indeed it has been known for a long time that fc-map graphs are 2fc-colorable 
P], but Corollary [I] has the advantage of being algorithmic. It is also known that 
A:-map graphs with k >8 can be colored with at most 2k — 3 colors |3], but the 
proof does not yield an efficient algorithm. 

Let G be a connected graph with nonnegative weights on its vertices. The 
weight of a subgraph of G is the total weight of vertices in the subgraph. A 
separator of G is a subgraph H oi G such that deleting the vertices of H from 
G leaves a graph whose connected components each have weight < where 
W is the weight of G. The size of a separator H is the number of vertices in H. 
The following lemma shows that fc-map graphs have small separators. 

Lemma 5. Given a k-map Ai = {£, <p) on which each nation has an associated 
nonnegative weight and at least two nations exist, it takes 0{\£\) time to find a 
separator H of the graph G of Ai such that the size of H is at most 2yj2k{n — 1). 

Using LemmaEl we can emulate Lipton and Tarjan Pj to prove the following 
theorem: 

Theorem 3. For any fixed integer k, there is a PTAS for the special case of the 
UMIS problem restricted to k-map graphs where the input ^raph is given together 
with a k-map Ai = {£,<p). It runs in 0(|£|log|£| -|- ^ n) time, where n is 
the number of nations on Ai, h = [i], and e is the given error parameter. 

5 Approximation Algorithms for Map Graphs 

This section presents four approximation algorithms. The first three are for the 
UMIS problem restricted to map graphs. The fourth is for the MIS problem 
restricted to map graphs. The input to the first algorithm is a map instead of 
a map graph; this algorithm is given here for the purpose of helping the reader 
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understand the other three algorithms. Also, the second algorithm is relatively 
more practical than the third. 

For a well- formed map A4 = (£,(f), let G(M) be the plane graph obtained 
from the dual plane graph £* of 8 by deleting all vertices that are lakes on M . . 
Note that corresponding to each 2“'"-point p in 8, there is a unique face !Fp in 
G{M) such that the vertices on the boundary of J^p are exactly the nations on 
M that touch p; the boundary of Tp is a simple cycle. We call Tp a point- face 
of G(A4). Those faces of G{M) that are not point-faces are called lake- faces. 
Consider the following algorithm, called MapISp. 

Input : A map M. = (£, <p). 

1. If at most two nations are on At, then return a maximum weight independent 
set of the graph of M. . 

2. If there is an isolated nation on At, then find such a nation /; otherwise, 
perform the following substeps: 

2.1. Modify Ad to a well-formed map, without altering the graph of Ad. 

2.2. Construct Q{M). 

2.3. Find a minimum-degree vertex / in G{M). 

3. Construct a new map Ad' = {8, p'), where for all faces f\ of 8, if /i = / or /i 
is a neighbor of / in the graph of Al, then <p'(/i) = 0, while <p'{fi) = p(/i) 
otherwise. 

4. Recursively call MapISi on input Ad', to compute an independent set J in 
the graph of Ad'. 

5. Return {/} U J. 



Theorem 4. MapISi achieves a ratio of 5 and runs in 0(|£p) time. 

Proof. (Sketch) The number of edges in G(A4) is at most 3n — 6. So, the degree 
of / in C/(Ad) is at most 5. We can then claim that the neighborhood of / in G 
can be divided into at most five cliques. 

Consider the following algorithm, called MapIS 2 - 

Input : A map graph G = {V, E). 

1. If G has exactly at most two vertices, then return a maximum weight inde- 
pendent set of G. 

2. Find a vertex f in G such that a(G/) < 5, where Gj = G[Ag(/)]. 

3. Remove / and its neighbors from G. 

4. Recursively call MapIS 2 on input G, to compute an independent set J . 

5. Return {/} U J. 



Theorem 5. MapIS 2 achieves a ratio of 5, and runs in \Nc{f)\^) 

time. 

Proof. By the proof of Theorem 0 Step 2 of MapIS 2 can always be done. 
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Consider the following algorithm, called MapIS^\ 

Input : A map graph G = (V, E), an integer A: > 4, and a positive real number 
e less than 1. 

1 . If G has exactly at most two vertices, then return a maximum weight inde- 
pendent set of G. 

2. Find a vertex f in G such that a(Gf) is minimized among all vertices / of 

G, where G/ = G[1 Vg(/)]. 

3 . If a{Gf) < 4, then perform the following substeps: 

3 . 1 . Remove / and its neighbors from G. 

3 . 2 . Recursively call MapISs on input G to compute an independent set J. 

3 . 3 . Return {/} U J. 

4 . Repeat the following substeps until G has no clique of size > A: -|- 1 (and hence 
is a A:-map graph): 

4 . 1 . Find a maximal clique G in G such that G contains at least A; -|- 1 
vertices. 

4 . 2 . Remove all vertices of G from G. 

5. Color G with at most 2k colors (cf. Corollary^; let Ji be the maximum set 
of vertices with the same color. 

6. Compute an independent set J 2 in G with | J 2 I > (1 — e)a{G). 

7. Return the bigger one among Ji and J^- 

To analyze MapISs, we need the following lemma: 

Lemma 6. |H] Let G be the graph of a map A4 = (£, p). Let G be a elique of G 
containing at least five vertices. Then, at least one of the following holds: 

— C is a pizza, i.e., there is a point p in £ such that all f € G touch p. 

— C is a pizza- with-crust, i.e., there are a point p in £ and an fi G G such 
that (1) all f G C — {/i} touch p and (2) for each f G C — {fi}, some point 
q ^ p in £ is touched by both fi and f . 

— C is a hamantasch, i.e., there are three points in £ such that each f G C 
touches at least two of the points and each point is touched by at least two 
nations. 



Theorem 6. If we fix k = 28 and e = 0.25, then MapISs achieves a ratio of 4 
and runs in 0{\V\ ■ ' 1^1) Aime. 

Proof. Let Go be the input graph. We may assume that Go is connected. By the 
proof of Theorem0 it suffices to prove that if a(G/) > 5, then Ji found in Step 
5 or J 2 found in Step 6 has size > . So, suppose that a(G/) > 5. 

Let n be the number of vertices in Gq. Fix a well- formed map M of Gq. 
For each integer i > k, let hi be the number of cliques G in Go such that 
(1) G contains exactly i vertices and (2) G is found in Substep 4.1 by the 
algorithm. We claim that the number x of edges in Q(A4) is bounded from 
above by 3n — 6 — ~ 4). To see this claim, let T\,...,Tt be the 

point-faces in Q(A4) whose boundaries each contain at least three vertices. For 
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each j G {1, ... ,i}, fix a vertex fj on the boundary of Tj. We can triangulate 
each face J-j by adding exactly — new edges incident to fj; call these new 
edges imaginary edges incident to fj. There are exactly \Tj\ — 'i imaginary edges 
incident to fj. Thus, x < 3n — 6 — \ ~ 3)- Let C be a clique of Go 

found in Substep 4.1. We say that an imaginary edge {fj, f'} with j G {1, . . . , t} 
is eontrihuted by C, if f' is contained in C. Let i be the number of vertices in 
C. To prove the claim, it remains to prove that the number of imaginary edges 
contributed by C is at least f — 4. By Lemma 1 only the following three cases 
can occur: 



— Case 1: C IS & pizza. Then, there is a point p in A4 such that all nations f 
in C touch p on Ad. Let Tj be the point-face in Q{M) that corresponds to p. 
No matter whether fj is in C or not, at least * — 3 imaginary edges incident 
to fj are contributed by C. 

— Case 2: C is a pizza- with-crust. Then, there is a point p in Ad such that 
\C\ — 1 nations in C touch p on Ad. Similarly to Case 1, at least (z — 1) — 3 
imaginary edges are contributed by C. 

— Case 3: C is a hamantasch. Then, there are three points pi through po in Ad 
such that each nation f'GC touches at least two of the three points on Ad 
and each point is touched by at least two nations in C. Since z > A:-|-l > 5, at 
most one of pi through po is a 2-point in Ad. Moreover, if one of pi through 
P3 is a 2-point, then C is actually a pizza-with-crust and Case 2 applies. So, 
we may assume that pi through po are 2"''-points in Ad. Let be the point- 
face in C/(Ad) that corresponds to pi, and ai be the number of nations in C 
that touch Pi on Ad. Define Ayj, 02, and 03 similarly. Then, the number 
of imaginary edges incident to /^y (respectively, fj^, or fj,^) contributed by 
C is at least oi — 3 (respectively, 02 — 3, or 03 — 3). Thus, the number of 
imaginary edges contributed by C is at least (01-1-02-1-03) — 9 > 2z — 9 > z — 4. 



By the claim, re < 3zz — 6 — J2^k+i ~ ^). The degree of / in Q{M) is 
at most [^J. On the other hand, since ct{Gf) > 5, the degree of / in Q{A4) 



is at least 5. Thus, 2X)“fe+i hi{i - 4) < n. In turn, X^^fe+i ^ 2(fc-3) 
J2i=k+1 — 



and 



^k+l "■* — 2(fe-3) ■ 

Let J be the bigger one among J\ and J2. By Step 6, | J| > (1 — e)(a(Go) — 
Yl’tLk+i^i)- Ly Step 5 and Corollary [IJ \J\ > ^ ■ (n - Thus, 



|J| 

a{Go) 






2(fe-3) a{Go) 



) and 



Cl 



> 



fc-7 



r. As the value of 



a(Go) - 4fc(fc-3) a(Go)- 

iTL increases while the second lower 



' decreases, the first lower bound on , 

a(Go) ’ a(Go) 

bound on decreases. The worst case happens when ■ 



Therefore, 



OI 



OI 

OGo) 



> (1 — e) • 3fc_^2fce~7 • fixing e = \ and k = 28, we have 






OGo) - 4 



Finally, the proof of Theorem^ leads to an approximation algorithm for the 
MIS problem restricted to map graphs that achieves a logarithmic ratio. 
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Theorem 7. Given a vertex-weighted map graph G = (V,E), we can find an 
independent set of G whose weight is at least g j°| |‘^y| • OPT{G), in 0{{J2fev 
WcifW + IC'D • log |y|) time. 

6 Concluding Remarks 

The impracticalness of the PTAS in Theorem 0 and hence that of MapIS^ 
mainly result from the impracticalness of the Bodlaender algorithm 0 for tree- 
decomposition. A more practical algorithm for tree-decomposition can make our 
algorithms more practical. 

Is there a practical PTAS for the MIS or UMIS problem restricted to map 
graphs? It would also be nice if one can design approximation algorithms for 
the problems that achieve better ratios than ours. Can MapISi and MapIS 2 
be implemented to run faster than as claimed? 

Finally, we point out that Lemma can be used to design a PTAS for the 
minimum weighted vertex cover (MWVC) problem restricted to map graphs. 

Theorem 8. There is a PTAS for the MWVG problem restricted to map graphs. 
It runs in 0{n^) time. 

Acknowledgments: I thank Professors Seinosuke Toda and Xin He for 
helpful discussions, and a referee for helpful comments on presentation. 
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Abstract. Given a set V and three relations ixi,;, ixi^ and cOi, we wish 
to ask whether it is possible to draw the elements v £V each as a closed 
disc homeomorph in the plane in such a way that (1) and U™ 
are disjoint for every (v,w) Gtxid, (2) and Um have disjoint interiors 
but share a point of their boundaries for every {v,w) Gixim, and (3) T>v 
includes T>n, as a sub-region for every (v,w) €ixii. This problem arises 
from the study in geographic information systems. The problem is in NP 
but not known to be NP-hard or polynomial-time solvable. This paper 
shows that a nontrivial special case of the problem can be solved in 
almost linear time. 



1 Introduction 



1.1 Topological Inference Problems 

An instance of the planar topological inference (PTI) problem is a triple (V,t><id 
,[xim), where U is a finite set, (subscript d stands for disjoint) and txim 
(subscript m stands for meet) are two irreflexive symmetric relations on V with 
ixid n txim= 0- The problem is to determine whether we can draw the elements 
r; of U in the plane each as a closed disc homeomorph T>y in such a way that 
(1) T>y and Vy, are disjoint for every (v,w) Sixi^;, and (2) and T>yj have 
disjoint interiors but share a point of their boundaries for every {v,w) GtOm- 
The PTI problem and its extensions have been studied for geographic infor- 
mation systems p2l,5ftif7psi9j . In the fully- conjunctive case of the PTI problem, 
it holds that for every pair of distinct v,w G V, either {(t>, w), (tc, u)} 
or {(t>, w), (w, ?;)} The fully-conjunctive PTI problem was first studied 

by Chen et al. and subsequently by Thorup m-, the latter gave a compli- 
cated inefficient but polynomial-time algorithm for the problem. Motivated by 
the maps of contemporary countries in the world, Chen et al. restricts the 
fully-conjunctive PTI problem by requiring that no point of the plane is shared 
by more than k closed disc homeomorphs T>y with v G V. Call this restriction the 
fully-conjunctive /c-PTI problem. The fully-conjunctive 3-PTI problem is just the 
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problem of deciding whether a given graph is planar. However, even the fully- 
conjunctive 4-PTI problem is difficult to solve; a very complicated 0(n^)-time 
algorithm for it was given in 

As has been suggested previously | llhttil/idj . a generalization of the PTI 
problem is obtained by adding another irreflexive, antisymmetric and transitive 
relation cxi^ (subscript i stands for inclusion) on V with ixi^ n([xi(i U txi^) = 0 and 
requiring that T>y includes T>^ as a sub-region for all (v,w) Gtxii. Unfortunately, 
no meaningful result has been obtained for this generalization. A natural restric- 
tion on is to require that each Vy with v G V and {(r;, w) GOOi | w G U} 0 
is the union of all T>y, with (v,w) Sixi^. We call this generalization the hierar- 
chical topological inference (HTI) problem. In the fully -conjunctive case of the 
HTI problem, for every pair of distinct v,w G V, exactly one of the following 
(1) through (3) holds: (1) {(u, w), (w, u)} Cixi^; (2) {(u, w), (w, u)} (3) 

{(u, w), (w, u)} n ([Xd U Xm) = 0 and |{(u, w), (w, u)}n x^ | = 1. 

Since no efficient algorithm for the fully-conjunctive HTI problem is in sight, 
we seek to restrict it in a meaningful way so that an efficient algorithm can 
be designed. As Chen et al. restricted the fully-conjunctive PTI problem, 
we restrict the fully-conjunctive HTI problem by requiring that no point of the 
plane is shared by more than k minimal closed disc homeomorphs T>y, i.e., those 
T>y with V gV such that {w G U | (v,w) Gx^} = 0. Call this restriction the 
fully-conjunctive A:-HTI problem. Unlike the fully-conjunctive 3-PTI problem, 
the fully-conjunctive 3-HTI problem is nontrivial. 



1.2 A Graph-Theoretic Formulation 

Throughout this paper, a graph may have multiple edges but no loops, while a 
simple graph has neither multiple edges nor loops. Also, a cycle in a graph H 
always means a vertex-simple cycle in H . 

A hierarchical map At is a pair {£,T), where (1) £ is a plane graph whose 
connected components are biconnected, and (2) IF is a rooted forest whose leaves 
are distinct faces of 8. The faces of 8 that are not leaves of IF are called the 
lakes on At, while the rest faces of 8 are called the leaf districts on At. For each 
non-leaf vertex a of A, the union of the faces that are leaves in the subtree of T 
rooted at a is called a non-leaf district on Al. Al is a planar map if each vertex 
of 8 is incident to at most three edges of £. Ad is a disc map if (1) the boundary 
of each leaf district on Ad is a cycle and (2) for every non-leaf district on Ad, 
there is a cycle C such that V is the union of the faces in the interior of C0. 
The map graph G of Ad is the simple graph whose vertices are the leaf districts 
on Ad and whose edges are those {/i, / 2 } such that the boundaries of fi and /2 
intersect. Note that two districts on a map Ad may share two or more disjoint 
borderlines (e.g. China and Russia share two disjoint borderlines with Mongolia 
in between). However, the map graph G of Ad is always simple, meaning that 
there is at most one edge between each pair of vertices. Note that G is planar 
when Ad is a planar map. We call (G, A) the abstract of Ad. 

^ In topology, is a closed disc homeomorph. 
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A graph-forest pair is a pair of a simple graph G and a rooted forest T whose 
leaves are exactly the vertices of G. In the study of geographic information 
systems, we are given G and T and need to decide whether the graph- forest 
pair (G, IF) is the abstract of a disc map. This problem is equivalent to the 
fully-conjunctive HTI problem. Its special case where each tree in IF is a single 
vertex is equivalent to the fully-conjunctive PTI problem. Moreover, its special 
case where we ask whether a given graph-forest pair (G, IF) is the abstract of a 
planar disc map is equivalent to the fully-conjunctive 3-HTI problem. 

1.3 Our Result and Its Significance 

Our result is an 0(|lF| -|- |G| log |G|)-time algorithm for deciding whether a given 
graph-forest pair (G,T) is the abstract of a planar disc map Ad, where |G| 
denotes the total number of vertices and the edges in G. The difficulty of this 
problem comes from the lack of information in G about whether there are lakes 
on Ad and how many borderlines two leaf districts on Ad can share. 

Since Chen et al.’s algorithm P for the fully-conjunctive 4-PTI problem is 
a reduction to the fully-conjunctive 3-PTI problem, we suspect that our result 
may serve as a basis for tackling the fully-conjunctive 4-HTI problem or even 
more general cases. 

The rest of the paper is organized as follows. § 0 gives definitions needed 
in later sections. Theorem 0 reduces the problem to the special case where the 
input planar graph G is biconnected. For this special case, if we know that two 
vertices z\ and Z 2 of the input planar graph G can be embedded on the outer 
face, then the algorithm in §0works. In §0 we discuss how to identify two such 
vertices Z\,Z 2 - 

See http://rnc2.r. dendai.ac.jp/~chen/papers/3map.ps.gz for a full version. 

2 Definitions 

Let G be a graph. V{G) and E(G) denotes the vertex-set and the edge-set of 
G, respectively. |G| denotes the total number of vertices and edges in G. Let 
E' C E{G) and U C V{G). The subgraph of G spanned by E', denoted by G[E'], 
is the graph {V ,E') where V consists of the endpoints of the edges of if'. G — E' 
denotes the graph (V {G),E{G) — E'). The subgraph of G induced by U is denoted 
by G[U]. G — U denotes G[V{G) — i/]. If G is a plane graph, then each of G[E'], 
G — E' , G\U\, and G — U inherits an embedding from G in the obvious way. 

A cut vertex of G is a u S y{G) such that G — {u} has more connected 
components than G. G is biconnected if G is connected and has no cut vertex. 
A biconnected component of G is a maximal biconnected subgraph of G. We 
simply call connected components 1- components, and biconnected components 
2- components. 

Let G be a plane graph. A k-face of G is a face of G whose boundary is a 
cycle consisting of exactly k edges. A superface of G is G[U] for some U C V(G) 
such that G\U] is connected and each inner face of G\U] is an inner 2- or 3-face 
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of G. The interior of a superface G\U] of G is the open set consisting of all points 
p in the plane that lie neither on the boundary nor in the interior of the outer 
face of G[U]. 

A graph-forest pair (G, iF) is planar if G is planar. A graph-forest-set (GFS) 
triple is a triple (G, T, S) such that (G, IF) is a planar graph-forest pair and S is 
a subset of V{G). For a rooted forest T and a non-leaf vertex / of T, let Lyr(f) 
denote the set of all leaves of T that are descendants of /. An edge- duplicated 
supergraph of a simple graph G is a graph Q such that (1) the vertices of Q are 
those of G and (2) for each pair of adjacent vertices in G has an edge between 
the two vertices. A plane graph Q satisfies a GFS triple (G,T,S) if (1) Q is 
an edge-duplicated supergraph of G and (2) for every non-leaf vertex / of IF, 
GiL^if)] is a superface of G whose interior contains no vertex of S. A GFS triple 
is satisfiable if some plane graph satisfies it. A planar graph- forest pair (G, J-) is 
satisfiable if the GFS triple (G, IF, 0) is satisfiable. 

We can assume that a GFS triple (G,iF,S) (resp., graph-forest pair (G, IF)) 
always satisfies that no vertex of T has exactly one child. By this, \T\ = 0(|G|). 

Theorem 1. The problem of deciding whether a given planar graph-forest pair 
(G, IF) is the abstract of a planar disc map can be reduced in 0(|G|log|G|) 
time, to the problem of deciding whether a given GFS triple (G, IF, S) with G 
biconnected is satisfiable. 



3 Biconnected Case 

Fix a GFS triple (G, IF, S) such that G is biconnected. We want to check if 
(G, IF, S) is satisfiable. To this end, we first check whether G[Lyr(^f)] is connected 
for all vertices / of IF. This checking takes 0(|G| log |G|) time. If (G,1F) fails the 
checking, then we stop immediately. So, suppose that (G, T) passes the checking. 



3.1 SPQR Decompositions 

A two-terminal graph^ (TTG) G = (y,E) is an acyclic plane digraph with a 
pair (s,t) of specified vertices on the outer face such that s is the only source 
(i.e. with no entering edge) and t is the only sink (i.e. with no leaving edge). 
We denote s and t by s{G) and t{Q), respectively. We also call s and t the poles 
of G and the rest vertices the nonpoles of G- A two-terminal subgraph of ^ is a 
subgraph H of Q such that H is a TTG and no edge of G is incident both to a 
nonpole of H and to a vertex outside H. Although ^ is a digraph, the directions 
on its edges are used only in the definition of its two-terminal subgraphs but have 
no meaning anywhere else. With this in mind, we will view Q as an undirected 
graph from now on. 

A split pair of G is either an unordered pair of adjacent vertices, or an un- 
ordered pair of vertices whose removal disconnects the graph (V{G), E{Q) U 
{{s(^), t(C/)}}). A split component of a split pair {m,u} is either an edge {u,u} 

^ Previously called a planar st-graph. 
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or a maximal two-terminal subgraph H (of Q) with poles u and v such that 
{u,v} is not a split pair of H. A split pair {m,u} of Q is maximal if there is no 
other split pair {wi, W 2 } in Q such that {u, u} is contained in a split component 
of {WI,W 2 }. 

The decomposition tree T oiQ describes a recursive decomposition of Q with 
respect to its split pairs |M4| . For clarity, we call the elements of V{T) nodes. 
T is a rooted ordered tree whose nodes are of four types: S, P, Q, and R. Each 
node /i of T has an associated TTG, called the skeleton of p, and denoted by 
Skl(^). Also, it is associated with an edge in the skeleton of the parent x of p, 
called the virtual edge of p in Skl(x). T is recursively defined as follows. 

Trivial case: Q consists of a single edge {s(^),t(^)}. Then, T consists of a 
single Q-node whose skeleton is Q itself. 

Series case: Q is not biconnected. Let ci, . . . , Ck-i {k > 2) be the cut vertices 
of Q. Since ^ is a TTG, each Ci S {ci, . . . , Cfe_i} is contained in exactly two 
2-components Qi and Qi+i of Q such that s(Q) is in and t(Q) is in Qk- The 
root of T is an S-node p. Skl(/i) is a path s(^), ei, ci, 62 , . . . , Ck-i, ek,t{Q). 

Parallel case: s{Q) and t{Q) constitute a split pair of Q with split components 
Gi, ■ ■ ■ ,Gk {k > 2). Then, the root of T is a P-node p. Skl(/r) consists of k parallel 
edges between s(C/) and t{G), denoted by ei, . . . , Cfc. 

Rigid case: none of the above cases applies. Let {si, ti}, . . . , {sfe, tk} be the 
maximal split pairs of G- For i = 1, k, let Gi be the union of all the split 
components of {si,ti}. The root of T is an R-node p. Skl(/r) is obtained from G 
by replacing each Gi G {Gi, ■ ■ ■ , Gk} with an edge e, = {s^, t*}. 

In the last three cases, p has children ... (in this order), such that 
Vi is the root of the decomposition tree of graph Gi for all i G { 1 , . . . , fc}. The 
virtual edge of node Vi is the edge Ci in Skl(^). Gi is called the pertinent graph 
of Vi and is denoted by Prt(t'i). 

By the definition of T, no child of an S-node is an S-node and no child of a 
P-node is a P-node. Also note that, for an R-node p, the two poles of Skl(/r) are 
not adjacent in Skl(^). A block of G is the pertinent graph of a node in T. For a 
non-leaf node pinT and each child v of p, we call Prt(^) a child block of Prt(/r). 

Lemma 1. Let z\ and Z 2 be two vertices of G such that adding a new edge 
between z\ and Z 2 does not distroy the planarity of G. Then, it takes 0(|G|) time 
to convert G to a TTG G with poles zi and Z 2 and to construct the corresponding 
decomposition tree T of G . Moreover, every TTG G' with poles zi and Z 2 such 
that G' satisfies {G,T,S), can be obtained by performing a sequence o/0(|G|) 
following operations: 

1. Flip the skeleton of an R-node of T around the poles of the skeleton. 

2. Permute the children of a P-node of T . 

3. Add a new Q-node child to a P-node p of T such that p originally has a 

Q-node child; further add a virtual edge to the skeleton of p. 

A desired pair for (G, J-, S) is an unordered pair {zi,Z 2 } of two vertices of G 
such that if {G,!F, S) is satisfiable, then it is satisfied by a plane graph in which 
both zi and Z 2 appear on the boundary of the outer face. 
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Our algorithm is based on Lemma D In order to save time, we want to first 
find a desired pair {zi, Z 2 } for (G, IF, S'). If IF is not a tree, then we can let {zi,Z 2 j 
be an edge such that there are two roots /i and /2 in IF with zi € and 

Z 2 G Ljr[f 2 ). If IF is a tree but |S| > 2, then we can let z\ and Z 2 be two arbitrary 
vertices in S. In case IF is a tree and |S| < I, it is not easy to find out Z\ and 
Z 2 ; 0 is devoted to this case. 

3.2 Basic Ideas 

Suppose that we have found a desired pair {zi, Z 2 } for (G, iF, S) and have con- 
verted G to a TTG G with poles zi and Z 2 - Throughout il3.2l 13. 3t and 13. 41 when 
we say that (G, iF, S) is not satisfiable, we mean that (G, iF, S) is satisfied by no 
plane graph in which z\ and Z 2 are on the boundary of the outer face. 

Let T be the decomposition tree of G- Our algorithm processes the P- or 
R-nodes of T in post-order. Processing a node /i is to perform Operations 1 
through 3 in Lemma ^ on fh® subtree of T rooted at /i, and to modify Prt(/i) 
and hence G accordingly. 

Before proceeding to the details of processing nodes, we need several defi- 
nitions. For a block B oi G, G — B denotes the plane subgraph of G obtained 
from G by deleting all vertices in V{B) — {s{B),t{B)} and all edges in E{B). 
For a plane subgraph H of G and a vertex f of fF, H C\ Lyr[f) denotes the plane 
subgraph of H induced by V{H) n Ljr[f). 

Let /i be a node of T. Let B = Prt(/J,). A vertex / in IF is ^-through if B D 
Lyr(f) contains a path between s{B) and t{B)\ is gi- out-through if {G—B)r\Lyr[f) 
contains a path between s{B) and t{B); is ^-critical if it is both /i-through and 
/r-out-through. Note that if a vertex / of IF is /i-critical (resp., /i-through, or 
/i-out-through) , then so is every ancestor of f in J-. A vertex / of iF is extreme 
/i-critical if / is /i-critical but no child of f in fF is. Define extreme /i-through 
vertices and extreme /i-out-through vertices similarly. 

Let vi, . . . ,Vk be the children of /t in T. For each i G {1, . . . , fc}, let be 
the virtual edge of i^i in Skl(/i), and B^ = Prt(iii). For a vertex / of IF, 
denotes the set {d \ f is iii-through}. We say / is fi-cyclic, if the plane subgraph 
of Skl(/i) spanned by E^j contains a cycle. Note that if / is /i-cyclic, then so is 
every ancestor of / in T . We say / is extreme fi-cyclic, if / is /i-cyclic but no 
child of / in IF is. 

Lemma 2. For each node fi ofT, there is at most one extreme ^-critical (resp., 
pi-through, or pi- out-through) vertex in T . 

For each node /i in T, if there is a /t-through (resp., /i-out-through, or pi- 
critical) vertex in fF, then let x-thr(/i) (resp., x-out-thr(/i), or x-crt(/i)) denote the 
extreme /t-through (resp., /t-out-through, or /t-critical) vertex in J-; otherwise, 
let x-thr(/i) = _L (resp., x-out-thr(/t) = _L, or x-crt(/i) = _L). 

Let pi he a node of T. An embedding of Prt(/t) is a TTG that is an edge- 
duplicated supergraph of Prt(/i) and has the same poles as Prt(/t). In an em- 
bedding B of Prt(/i), the poles of B divide the boundary of the outer face of 
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B into two paths, which are called the left side and the right side of B, respec- 
tively. An embedding B of Prt(/i) is potential if for every vertex / in IF with 
V{B) n Lj^{f) 7 ^ 0, each connected component of S n Ajf(/) is a superface of 
B n whose interior contains no vertex of S. When /i is a P-node (resp., 

R-node), a potential embedding B\ of Prt(/i) is better than another potential 
embedding B 2 of Prt(/r) if B\ has more sides consisting of exactly one edge 
(resp., two edges) than B 2 - An embedding of Prt(/r) is optimal if it is potential 
and no potential embedding of Prt(^) is better than it. 

Lemma 3. Suppose that an embedding Q' of G satisfies (G,iF,S). Let pi be a 
node ofT. Let B^ be the subgraph of G' that is an embedding o/Prt(/r). Let 
be an optimal embedding o/Prt(/r). Then, the embedding of G obtained from G' 
by replacing B^ with still satisfies (G,iF,S). 

Right before the processing of a P- or R-node pi, the following invariant holds 
for all V that is a non-S-child of /i or a child of an S-child of ^ in T: 

— An optimal embedding B,^ of Prt(i^) has been fixed, except for a possible 
flipping around its poles. 

In the description of our algorithm, for convenience, we will denote the fixed 
embedding of Prt(i^) still by Prt(i^) for all nodes v oiT that have been processed. 



3.3 Processing R-Nodes 

Let p, be an R-node of T. Let B = Prt(/r) and K = Skl(^). Let vi, . . . ,i^k be the 
children of p in T. For each i G {1, . . . , k}, let Cj be the virtual edge of I'i in K, 
and Bi = Prt(j^i). 

Since p is an R-node, each side of B contains at least two edges. If x-crt(/r) = 
/ yf _L, at least one side of B must be a part of the boundary of an inner 3-face 
of the superface induced by Ljr[f). In this case, if {s{B),t{B)} ^ E{G), then 
(G, IF, S) is not satisflable because no side of B can be part of the boundary of 
a 3-face in an embedding of G- So, we assume that {s{B),t{B)} G E{G) when 
x-crt(/i) yf _L. Processing p is accomplished by performing the following steps: 

R-Procedure: 

1 . Compute the rooted forest T' obtained from T by deleting all vertices that 
are not ^-cyclic. 

2. For all vertices / in T' , perform the following substeps: 

(a) For every inner face E of K[Ef,,j], check if E is also an inner 3-face of 
K; if not, stop immediately because (G, IF, S') is not satisflable. 

(b) For every edge Cj G E^^j that is not on the outer face of K[Enj], check 
if V{Bi) C Ly^{f); if V{Bi) % Lyr(/), then stop immediately because 
(G, T , S) is not satisflable. 
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3. For all roots / in T' , perform the following substeps: 

(a) For every edge G that is not on the outer face of K[E^j], check 
if (1) S n {V{Bi) — {s{B),t{B)}) = 0 and (2) both sides of Bi consist of 
a single edge; if (1) or (2) does not hold, then stop immediately because 
{G, E, S) is not satisfiable. 

(b) For every vertex v oi K that is not on the boundary of the outer face 
of K[E^j], check if u ^ S'; if u G S', then stop immediately because 
(G, IF, S) is not satisfiable. 

(c) For every edge shared by the boundaries of the outer face of K[E^j] 
and some inner face E of K[E^j], check if some side of Bi consists of a 
single edge and has not been labeled with the index of a face other than 
F\ if no such side exists, then stop immediately because (G, E, S) is not 
satisfiable, while label one such side with the index of E otherwise. 

4. For each side P oi K consisting of exactly two edges, and for each edge on 
P, if some side Pi of Bi consists of a single edge and has not been labeled, 
then mark the side Pi of Bi “outer” . 

5. For all child blocks Bj of B, if at least one of the following holds, then modify 
B by hipping Bj around the poles of Bj : 

— A side of Bj is marked “outer” but is not on the boundary of the outer 
face of B. 

— A side of Bj is labeled with the index of some face F oi K but that side 
is not on the boundary of the corresponding face of B. 

6. If x-crt(/i) = /i yf _L, check whether some side of B consists of exactly two 
edges and the common endpoint of the two edges is in Lyr(/j^); if no such 
side of B exists, then stop immediately because {G,E,S) is not satisfiable. 

After /j, is processed without failure, the invariant in t |,S.2l ho]d for /r. 



3.4 Processing of P-Nodes 

Let fj, he a P-node of T. Let B = Prt(/r). Let vi, . . . ,Vk be the children of /r 
in T. For each i G {!,..., fc}, let be the virtual edge of Vi in Skl(^), and 
Bi = Prt(^'j). 

If there is neither ^-cyclic vertex nor /r-critical vertex in E, then the pro- 
cessing of ^ is finished by (I) checking if {s{B),t{B)} G E{G) and (2) if so, 
modifying B by permuting its child blocks so that edge {s{B),t{B)} forms one 
side of B. So, assume that there is a /r-cyclic or /r-critical vertex in E . Then, 
(G,E,S) is not satisfiable, if {s{B),t{B)} ^ E{G). So, further assume that 
{s{B),t{B)} G E{G). Since {s(i?), C for all ^-through vertices 

/ in E, every /r-cyclic or ^-critical vertex must appear on the path P from 
/o = x-thr(/r) to the root fy of the tree containing /o in E . If fy is a /r-critical 
vertex, we process /r by calling P-Procedurel below; otherwise, we process \i by 
calling P-Procedure2 below. 
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P-Procedurel: 

1. Let = {ci^, . . . ,€i^} where Vi^ is a Q-node. 

2. For all j G {2, . . . , h}, mark e^- if (1) some side of Bi does not consist of 

exactly two edges, or (2) S n - {s{B),t{B)}) ^ 0, or (3) V(Bi-) g 

Lr{fo)- 

3. Depending on the number m of marked edges, distinguish three cases: 

— Case 1: m = 0. Then, perform the following substeps: 

(a) Modify B by permuting its child blocks so that (1) the edge Bi^ is 
on the left side of B and (2) Bi ^ , . . . , Bi^ appear consecutively. 

(b) For each j G {2, . . . , h}, add an edge {s{B),t{B)} right after Bi. . 

— Case 2: m = 1. Let the marked edge be and perform the following: 

(a) If is an R-node, perform the following substeps: 

i. Check whether some side P' of Bi^ satisfies that (1) there are 
exactly two edges e{ and e '2 on P' and (2) the common endpoint 
of e']^ and e '2 is not in S but is in Ljr[f) where / = x-thr(^'i^,). 
(Comment: / is on P, i.e., / is an ancestor of /o in T .) 

ii. If P' does not exist, then stop immediately because {G,T,S) is 
not satisfiable; otherwise, flip Bi^. around its poles if P' is not 
the left side of Bi^. 

(b) If Vi^ is an S-node, then perform the following substeps: 

i. If Vi^ has more than two children, then stop immediately because 
(G, P, S') is not satisfiable. 

ii. Let the children of be xi and X 2 - For each j G {1,2}, if no 
side of Prt(xj) consists of a single edge, then stop immediately 
for failure; otherwise, flip Prt(xj) around its poles when the left 
side of Prt(xj) does not consist of a single edge. 

(c) Modify B by permuting its child blocks so that (1) the edge Bi^ is on 

the left side of B and (2) Pj^+i, . . .,Bi^,B^^ appear 

consecutively in this order. 

(d) For all j e {2, . . . , /i| — {£}, add an edge |s(P), t{B)} right after Bi. . 
— Case 3: m > 2. Then, stop because (G, P, S') is not satisfiable. 

P-Procedure2: 

1 . Perform Steps E and 0 of P-Procedurel . 

2. Depending on the number m of marked edges, distinguish three cases: 

“ Case 1: TO = 0. Same as Case 1 in P-Procedurel. 

— Case 2: TO = 1. Same as Case 2 in P-Procedurel. 

— Case 3: TO = 2. Let and be the two marked edges where ii < it,/- 

Then, perform the following substeps: 

(a) If Vi^ is an R-node, then perform Subst eps |3(a)i| and |3(a)ii| of P- 
Procedurel; otherwise, perform Substeps |3(b)i| and |3(b)ii| ofP-Pro- 
cedurel. 
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(b) If Vi^, is an R-node, perform the modification of Substeps |3(a)i| and 
|3(a)ii| of P-Procedurel obtained by (1) changing to it and (2) 
changing the word “left” to “right”; otherwise, perform the modi- 
fication of Substeps |3(b)i| and |3(b)ii| of P-Procedurel obtained by 
doing (1) and (2). 

(c) Modify B by permuting its child blocks so that Bi ^, , Bi ^ , . . . , 

. . . , B^^,+i, . . . ,B^^, Bi^ appear consecutively in this 

order. 

(d) For all j G {2, . . . , /i} — {£, £'}, add an edge {s{B), t{B)} right after 
B, . 

‘‘3 

— Case 4: m > 3. Then, stop because (G, IF, S) is not satisfiable. 

After /j, is processed without failure, the invariant in t |.S.2l ho]d for /r. 
Theorem 2. The overall algorithm can be implemented in 0(|G| log |G|) time. 

4 The Single-Tree Case 

Assume that T is, & tree and [S'! < 1. A candidate edge is an edge {z\,Z 2 \ of G 
such that (1) the lowest common ancestor of z\ and Z 2 in T is the root of T 
and (2) if \S\ = 1, then {zi,Z 2 } n S' yf 0. 

A plane graph Q witnesses (G, JF, S) if (1) Q is an edge-duplicated supergraph 
of G, (2) for every non-leaf and non-root vertex / of iF, Q[Lyr(^f)] is a superface 
of Q whose interior contains no vertex of S, (3) at most one face of is a fc-face 
with k > 4, and (4) if |S| = 1 and a fc-face F with k > 4 exists in Q, then the 
unique vertex of S appears on the boundary of F. 

Lemma 4. Let {zi,Z 2 } be a candidate edge ofG. Then, (G,F,S) is satisfiable 
if and only if there is a plane graph FI such that (1) H witnesses (G,F,S) and 
(2) both zi and Z 2 appear on the boundary of the outer face in H . 

Using Lemma 0 and modifying the algorithm in we can prove: 

Theorem 3. Given a graph-forest pair (G, F), it takes 0(|lF| -|- |G| log |G|) time 
to decide whether (G, F) is the abstract of a planar disc map. 
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Abstract. Given the trapezoid diagram, the problem of finding the 
minimum cardinality connected dominating set in trapezoid graphs was 
solved in 0{m + n) time jjj; the results is recently improved to be 0{n) 
time by Kohler |S]- For the (vertex) weighted case, finding the minimum 
weighted connected dominating set in trapezoid graphs can be solved 
in 0{m -b nlogn) time Here n (m) denotes the number of vertices 
(edges) of the trapezoid graph. 

In this paper, we show a different approach for finding the minimum 
cardinality connected dominating set in trapezoid graphs using 0(n) 
time. For finding the minimum weighted connected dominating set, we 
show the problem can be efficiently solved in 0(n log log n) time. 



1 Introduction 

The intersection graph of a collection of trapezoids with corner points lying on 
two parallel lines is called the trapezoid graph. Trapezoid graphs is first proposed 
by Dagan et al. |2| showing that the channel routing problem is equivalent to the 
coloring problems on trapezoid graphs. Throughout the paper, we use n (m) to 
represent the number of vertices (edges) in the given graph. Ma and Spinrad jlT!) 
show that trapezoid graphs can be recognized in O(n^) time. Note that trapezoid 
graphs are cocomparability graphs, and properly contain both interval graphs 
and permutation graphs (Q. Note that interval graphs are the intersection graph 
of intervals on a real line; permutation graphs are the intersection graph of line 
segments whose endpoints lying on two parallel lines. 

A dominating set of a graph G = (V, E) is a subset D oi V such that every 
vertex not in D is adjacent to at least one vertex in D. Each vertex v € V can 
be associated with a (non-negative) real weight, denoted by w{v). The weighted 
domination problem is to find a dominating set, D, such that its weight w{D) = 
Eugd is minimized. A dominating set D is independent, eonnected or total 
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if the subgraph induced by D has no edge, is connected, or has no isolated vertex, 
respectively. Dominating set problem and its variants have many applications 
in areas like bus routing, communication network, radio broadcast, code-word 
design, and social network 

The decision version of the weighted domination problem is NP-complete 
even for cocomparability graphs 0. For trapezoid graphs, Liang 0 shows that 
the minimum weighted domination and the total domination problem can be 
solved in 0{mn) time. Lin |S| show that the minimum weighted independent 
domination in trapezoid graphs can be found in 0(n log n) time. Srinivasan et al. 
P show that the minimum weighted connected domination problem in trape- 
zoid graphs can be solved in 0(m -I- nlogn) time. For the unweighted case, 
the 0(n log n) factor is improved by Liang 0, who show that the minimum- 
cardinality connected domination problem in trapezoid graphs can be solved in 
0{m + n) time. However, since the number of edges, m, can be as large as O(n^), 
the potential time-complexity of the algorithm is still 0{v?). In this paper, we 
show that the minimum cardinality connected domination problem in trapezoid 
graphs can be solved optimally in 0(n) time. Further, for the weighted case, we 
show the problem can be efficiently solved in 0(n log log n) time. 

This paper is organized as follows. Section establishes basic notations and 
some interesting properties of trapezoid graphs concerning the connected domi- 
nation problem. SectionElgives our 0(n)-time algorithm of finding the minimum- 
sized connected dominating set in trapezoid graphs. Section^shows that finding 
the minimum weighted connected domination can be done in O(nloglogn) time. 
Finally, we conclude our results in Section 0 



2 Basic Notations and Properties 

A graph G = (V,E) is a trapezoid graph if there is a trapezoid diagram T such 
that each vertex Vi in V corresponds to a trapezoid ti in T and {vi,Vj) € E 
if and only if ti and tj intersects in the trapezoid diagram. Here the trapezoid 
diagram consists of two parallel lines Li (the upper line) and L 2 (the lower line). 
Further, a trapezoid U is defined by four corner points ai,bi,Ci,di such that 
[ai-.bi] {[ci..di]) is the top (bottom) interval of trapezoid ti on Li (L 2 ); note that 
Oi < bi and Ci < di. 

It can be shown that any trapezoid diagram can be transformed into another 
trapezoid diagram with all corner points are distinct while the two diagrams still 
correspond to the same trapezoid graph. Thus, we assume that the two parallel 
lines, L\ and L 2 , are labeled with consecutive integer values 1, 2, 3, . . . , 2n from 
left to right; that is, U(L;^{ai,5i} = = [1..2n]. We assume that these 

trapezoids are labeled in increasing order of their corner points biS. We define 
two trapezoids U -< tj if bi < aj and di < Cj; that is, the top (bottom) interval 
of ti totally lies on the left of the top (bottom) interval of tj . Given a trapezoid 
diagram, the corresponded partial ordered set is a trapezoid order. Note that two 
trapezoids intersect each other (denoted by tk ~ t;) if and only if neither 
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tk -< ti nor ti -< tk', that is, they are incomparable in the trapezoid (partial) 
order. 

Throughout the paper, we use the words “trapezoid” and “vertex” inter- 
changeably for convenience. A vertex Vi (vj) is called a left (right) dominator ii 
and only if there are no vertices lie on the left (right) of Vi (r'j')in the trapezoid 
diagram. That is, (\/v € V){v Vi) and (Vf G y){vj ft v). Let be a minimum 
cardinality connected dominating set of the trapezoid graph G. Clearly, \D\ = 1 
if and only if there is a vertex v G G that is a left and right dominator at the 
same time. 

Two adjacent vertices u, v form a source pair {sink pair) if and only if there 
are no vertices lie on the left (right) of u and v at the same time. A vertex v is 
a source (sink) vertex if there is another vertex u such that {it, i;} form a source 
(sink) pair. 

Proposition 1. Given the trapezoid diagram, v is a source (sink) vertex if and 
only if there is a left (right) dominator d G V such that v d. It is easily seen 
that \D\ = 2 if and only if |I?| yf 1 and there is a pair (it, v} G G that is a source 
pair and a sink pair at the same time. For |I?| > 3, we have the following lemma 

0 for cocomparability graphs; it thus holds for trapezoid graphs as well. 

Lemma 1. For a connected cocomparability graph G, there exists a minimum 
cardinality connected dominating set that induces a simple path in G. Further- 
more, let P = (pi, . . . ,pk) be such a path with k > 3, then pi -< Pi +2 for 

1 < i < k — 2, and every vertex before min{pi,p 2 } is covered by {pi,P 2 } and 
every vertex a/ter max{pfe_i,pfe} is covered by {pk-i,Pk\ ■ 

Now consider the case that \D\ > 3. Within all left dominators, let b (d) be 
the vertex having the rightmost top (bottom) corner; it is possible that b, d are 
the same vertex. By Lemma [D and Proposition ^ it follows that the minimum 
cardinality connected dominating set for \D\ >3 can be found by finding a 
shortest-length path, from b or d, until one of the leftmost left dominators {a, c} 
is reached. That is, let Pba, Pbc, Pda, Pdc denote these four shortest paths. Among 
these four paths, the one with the smallest cardinality is the desired minimum 
cardinality connected dominating set. 



3 Minimum Cardinality Connected Domination 

Immediately following the previous discussions, the strategy of the algorithm 
can be described as following: for each vertex v, decide whether f is a left (right) 
dominator, and check for the special cases for 1 < |Z?| < 2. If it turns out that 
\D\ > 3, we can find the shortest-length path from the rightmost corners of left 
dominators until reaching the first right dominator. 

We first show that all the dominators can be found in 0{n) time. Consider 
the right (left) tangent edge of a trapezoid tj, the vertical line segment bidi 
{oiCi) connecting Li and L 2 , is called a right (left) stick, denoted by = bidi 
{li = OiCi)- Treating a stick as a degenerated case of trapezoid, we say sticks 
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Ti A Tj if and only if bi < bj and di < dj. A group of sticks, S, is called a pencil 
if every two sticks of S intersect each other. Consider the set of all right sticks 
S = {ri, r 2 , . . . , Vn}- A stick in S is eliminated if there is another stick lies on the 
left of it; the resulting sticks form a left-pencil, denoted by L = , . . . , ) . 

Note that bi^ < bi^ < ■ ■ ■ < bi,, , but di^ > di^ > ■ • • > di,, . It is readily seen that 
vertices {vi-^ jVi^, ■ ■ ■ ,Vi,.} are left dominators. The following discussions concern 
properties of the left-pencil; similar properties hold for the right-pencil as well 
by symmetry . 

Proposition 2. Let L be the left-pencil of the trapezoid diagram. A vertex v is 
not a left dominator if and only if (3r C L)(r -< v). Further, the left-pencil L 
can he found in 0{n) time. The left-pencil structure provides an efficient way 
of checking whether a given vertex is a left dominator. 

Lemma 2. All left (right) dominators of the trapezoid graph can be identified 
in 0{n) time. 

After obtaining all left dominators Vl and right dominators Vr in 0(n) time, 
we compute the rightmost top (bottom) corners among all left dominators; let 
b = max{6i | Vi € VL},d — maxjdi | Vi G Vl}. The computation of corners b 
and d clearly takes 0{n) time. Symmetrically, the leftmost top (bottom) corners 
among all right dominators, a = minjoi | Vi G Vr},c = min{ci | Vi G Vr} is 
computed in 0{n) time. For brevity, when the context is clear, we also use a,c 
(b,d) to denote the vertices Vi’s whose afs, cfs {bi’s, dfs) values are minimized 
(maximized) . 

Let D denote the minimum cardinality connected dominating set. Recall 
that, \D\ = 1 if and only if there is a vertex v being a left and right dominator 
at the same time; it can be easily checked within 0{n) time. Now consider the 
case that \D\ = 2. In which case, we say that D = {w,u} is a dominating pair. It 
is possible that both vertices of D are dominators, one left dominator and one 
right dominator. In which case, we must have a < 6 or c < d; it follows that 
D = {a,b} or {c,d}. However, there is another possibility for \D\ = 2: 

Proposition 3. Let D = {u, u} he a dominating pair of a trapezoid diagram T . 
If none of the left dominators of T intersects with any of the right dominators, 
i.e., b < a and d < c, then neither u nor v is a dominator. 

Now consider the left-pencil L = (r^^ ,ri^, . . . ,ri^) and the right-pencil R — 
By Propositional and El neither u nor v fully dominates L or 
R although {u, v} collectively dominate both L and R. Thus, we have two types 
of coverage of L and R in terms of the shapes of intersection of trapezoids u and 

V. 

Case 1: the X-shaped intersection. That is, we have u ~ , Dg} U 

{^ 3 h'+i , ■ ■ ■ , J and u ~ {r,, , . . . , } U {ij , , . . . , }. 

Case 2: the 8-shaped intersection. That is, we have u ^ • ,Dg} U 

{£ji , . . . , } and u ~ {ri, , . . . , n ^, } U , ■ • ■ , J- 

We can show that 

Lemma 3. The X-shaped (8-shaped) dominating pair of a trapezoid diagram 
can be found in 0{n) time. 
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Theorem 1. The minimum cardinality connected dominating set of a trapezoid 
diagram can be found in 0(n) time. 



Algorithm MCDS(T) 

Input: A trapezoid diagram T with n trapezoids. Each trapezoid Vi is defined 
by four corner points ai,bi,a,di, 1 < i < n. They are labeled with consecutive 
integer values 1, 2, 3, ... , 2n; that is, U"^i{ai, t>i} = U”^i{ci, di} = [1..2n]. 
Output: A minimum cardinality connected dominating set D. 

Step 1: Identify all left dominators Vl and right dominators Vr. 

Step 2: \D\ = 1 if Vl n Vr 0; output any vertex n G Vl n Vr as the singleton 
dominating set and stop. 

Step 3: Compute b = max{t>i | Vi € Vl};^ = max{di | Vi € Vl};o = min{ai | 
Vi e Vr};c = min{ci | Vi G Vr}. 

Step 4- (* Deal with \D\ = 2. *) Stop and output D = {a,b} if o < 6; stop 
output {c, d} if c < d. Further, stop and output D if there is a dominating 
pair with X-shaped or 8-shaped intersection. 

Step 5: (* Now, \D\ > 3. *) Let Pba, Pbc, Pda, Pdc be the four shortest paths 
from {6, d} to {a, c}. Among these four paths, output the one with the smallest 
cardinality. 

End of MCDS 



Fig. 1. Finding the minimum cardinality connected dominating set in trapezoid 
graphs. 



Proof. The algorithm is shown in Figured Note that Step 1 of the algorithm 
takes 0{n) time by Lemma 0 Clearly Step 2 and Step 3 can be done in 0{n) 
time. Step 4 is the most complicated case of the algorithm; it has been shown 
in Lemma d that the dominating set can be correctly found in 0{n) time. 

The rest of the proof appears in the complete paper. □ 



4 Minimum Weighted Connected Domination 

A minimal connected dominating set is a connected dominating set such that 
the removal of any vertex leaves the resulting subset being no longer a connected 
dominating set. It is easily seen that a minimum weighted connected dominating 
set is minimal since the assigned weights are non-negative. It can be shown m 
that a minimal connected dominating set of a trapezoid graph is consisted of 
three parts: S,P, and T; here S denotes the set of a dominating source (a left 
dominator or a source pair), T denotes the set of a dominating target (a right 
dominator or a sink pair), and P denotes a (lightest) chordless path from S to T. 
Note that the dominating source (target) can be a singleton or a pair of vertices. 
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We associate with each vertex v an aggregated weight, w'(v), to be the sum 
of weights of the minimum connected dominating vertices from the source set to 
V, containing the vertex v as the dominating set. In other words, 

w'(v) = min{^^ w(u) | P : a path from a left dominator (source pair) to u} 
ueP 

We first show that the source vertices can be initialized efficiently. The aggre- 
gated weight for a left dominator u is, by definition, w'{u) = w(u). For a non- 
dominator source vertex v, it joins with another vertex x to form a minimum 
weighted source pair containing v. It is possible that the joint partner x is a 
dominator; the aggregated weight of v can be thus deduced from the dominator. 
The general scheme of the algorithm will deal with the situation properly. Oth- 
erwise, X is another non-dominator but together {u, x} constitute a minimum 
weighted source pair. We call such kind of a source pair a non-dominator source 
pair. 

Lemma 4. The aggregated weights of all non-dominator source pairs of the 
trapezoid graph can be found in 0(ji) time. 

Proof. The proof appears in the complete paper. □ 

Let two adjacent vertices {u, u} form a sink pair. If neither u nor u is a right 
dominator, we call such kind of a sink pair a non-dominator sink pair. Note 
that the target vertices can either be a singleton right dominator or two vertices 
forming a non-dominator sink pair. For each non-dominator sink vertex v, there 
is a (possibly none existed) lightest non-dominator vertex x so that together 
{v,x} form a non-dominator sink pair. We assign such a non-dominator sink 
vertex v a modified weight of v with a value w{v) -\- w(x). We can show that 

Lemma 5. The modified weights of all non-dominator sink pairs of the trapezoid 
graph can be found in 0{n) time. 

Proof. The proof appears in the complete paper. □ 

We now present our method of calculating the aggregated weights of every 
vertex. The idea is to set up two priority queues, say Qi and Q 2 , along with 
the two parallel lines, Li and L 2 , on the trapezoid diagram. The (so far calcu- 
lated) aggregated weights of the vertices are stored upon the priority queues. 
The algorithm will consider both of the priority queues and sweep along left (top 
or bottom) corners of trapezoids in a zigzag fashion. Note that we record the 
aggregated weights upon the positions of right corners of the trapezoids consid- 
ered so far; the aggregated weights are kept according to the increasing order 
along the parallel lines. Since the input diagram of the trapezoid graph is given 
in a finite range of integers, we can use the data structure of van Emde Boasf2| 
to maintain a priority queue such that each operation of inserting, searching, 
and deleting an element in the priority queue can be done in O(loglogn) time. 
Thus, by carefully maintaining the priority queues and balancing the order of 
the visited vertices, we can show that 
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Lemma 6. The aggregated weights of all vertices of the trapezoid graph can be 
found in 0 (n log log n) time. 

Proof. We begin by inserting the aggregated weights of all the left dominators 
and non-dominator source pairs into the priority queues Qi and Q2. Note that 
the top right corners of source vertices are inserted into Qi, and the bottom 
right corners are inserted into Q2. The aggregated weights are kept on both 
priority in increasing (weighted) order. Let the currently maintained list on Qi = 
It follows that bi^ < < ■ ■ ■ < bi^ and < w'iv^^) < 

• •■ < w'{vif.). Symmetrically, let Q2 = we have dj^ < dj^ < 

• • ■ < dj^ and w'{vj^) < w'{vj2) < ■ ■ • < w'{vj^). In particular, assume that a 
vertex v with position b(v) G [vig..Vi^^f\ is inserted into Q\. In the following, we 
describe how to join a vertex v into Qi: The vertex v is discarded if w'{v) > 
w'{vig+i)- Otherwise, we insert the vertex v into Qi and remove all vertices lied 
on the left of b{v) with w'{-) larger than w'{v). The operation of joining d{v)’s 
into Q2 follows symmetrically. 

After the source vertices have been properly joined into Qi and Q2, let 
Qi = {vii,Vi2, . ■ ■) and Q2 = . . .). The algorithm starts by checking 

the smaller value of u>'(ujj}. Assume that w'{vi^) < it follows 

that vertices vfs with their top left corners satisfying at < bi^ now have their 
aggregated values decided. That is, w'{vi) = w{vi) + w'{vi^). The situation is 
illustrated in Figure 0 Note that the top right corner of the vertex Vi will then 



a(v) ''b{x) 





cfvj ^d{x) 



y ! w'{v) = \v{v) + w'{x) 

Qi 



Fig. 2. Finding the aggregated weights of all vertices. 



be joined into Qi, and the bottom right corner of the vertex Vi will be joined 
into Q2. We then discard the vertex Vi^ from Q\. The case that w'{vjf) < w'(viy) 
can be handle symmetrically. 

It is readily verified that the algorithm described above gives the correct ag- 
gregated weights of all vertices, and the total efforts spent is at most 0 {n log log n) 
time. The rest of the proof appears in the complete paper. □ 

Let two adjacent vertices {u,u} form a dominating pair of the trapezoid 
graph. There are three kinds of dominating pairs. The first case of a dominating 
pair is when one of the vertices is a left dominator and the other vertex is a right 
dominator. It is easily seen that the general scheme of the algorithm deal with 
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the situation correctly. The second case of a dominating pair is when one of the 
vertices is a dominator, but the other vertex is not a dominator. Here we show 
how to find the minimum weighted connected dominating pair of such kind. 

Lemma 7. The minimum weighted connected dominating pair, forming by a 
dominator and a non- dominator, can be found in 0(n) time given the trapezoid 
diagram. 

Proof. The proof appears in the complete paper. □ 

The third case of a dominating pair is when none of the vertices is a dominator. 
That is, we have a X-shaped or a 8-shaped dominating pair. The case considered 
here is actually the most complicated situation of the algorithm. We can also 
show that 

Lemma 8. The minimum weighted connected, X-shaped or 8-shaped, dominat- 
ing pair of a trapezoid diagram can be found in O(nloglogn) time. 

Proof. First we show that the minimum weighted dominating pair with X- 
shaped intersection can be found in 0(n log log n) time. Let the left-pencil L — 
(j'ii ) 5 • • • 5 Dg ) and the right-pencil R = , . . . , ) . Further, let the min- 

imum weighted dominating pair be the set {u, a;}. Without loss of generality we 
may assume that vertex v dominates the lower part of L and the upper part of 
R, while X dominates the upper part of L and the lower part of R. The idea here 
is that, while the bottom left corner of v located at interval [di,^^„..di^.], the top 
left corner of x must be located before the position 6^^; i.e., a(x) G So, 

by pairing these {u,a;}’s pair and checking their domination of the right-pencil, 
we can find the minimum, X-shaped, dominating pair. 

Maintain two priority queues: Qi for the upper intervals of right-pencil and 
Q2 for the lower intervals of the right-pencil. Note that Qi is initialized by insert- 
ing all upper positions of the right-pencil R. Scan each vertex x within [6^^ ..bi^], 
the first upper interval of L. Recall the join operation we defined in Lemma El 
We will join the bottom right corners of x (as well as their corresponded weights 
w(x)’s) into the queue Q2. Then, for each vertex v within the last lower interval 
of L, [dij ..dif\, we probe the top right corners of v to seek its position in Qi, say 
b{v) G [o-jk-’^jk+i]- It follows that those x’s whose bottom right corners greater 
than Cjk.^1 can be paired with v to become a dominating pair. To get the lightest 
joint vertex, we can probe into Q2, seeking to the right, and obtain the de- 
sired minimum joint partner x. We maintain the sum of weights of the lightest 
dominating pair obtained so far; each time a joint partner is found, the sum of 
their weights is compared with it. The minimum weighted dominating pair can 
thus be found by this manner. The situation is illustrated in Figure El 

We obtain the final minimum weighted dominating pair by successively join- 
ing the vertices cornered at [6^2 •■^13] into Q2 and probing vertices cornered at 
[di2..di,j] into Qi, and so on, until the final intervals and [dig_i..dig] 

have been reached. Note that positions d(a:)’s are accumulatively joined (with 
side effects) into Q2 while positions b{v)’s are probed into Qi interleavedly with 
joining of d{x)’s. Since each operation to the priority queue takes O(loglogn) 
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Fig. 3. Finding a X-shaped (left), or a 8-shaped (right) minimum weighted 
dominating pair. 



time, and we spend only a constant number of queue operations for each vertex, 
it is not hard to see that the total efforts spent is at most 0(n log log n) time. 

The case of 8-shaped dominating pair is similar to the case discussed above. 
The rest of proof appears in the complete paper. □ 

We are now ready to show that 

Theorem 2. The minimum weighted connected dominating set of a trapezoid 
diagram can he found in O(nloglogn) time. 



Algorithm WCDS(T) 

Input: A trapezoid diagram T with n trapezoids. 

Output: A minimum weighted connected dominating set D. 

Step 1: Find the left-pencil (right-pencil) and all left (right) dominators. 

Step 2: Initialize the aggregated weights of left-dominators and non-dominator 
source pair. 

Step 3: Find the non-dominator sink pairs, set their modified weights, and mark 
these right dominators and non-dominator sink pairs as the target vertices. 
Step 4 - Calculate the aggregated weights w'{-) of all the vertices along the 
two parallel lines. Scan through each right dominator and the modified sink 
pairs, and find the one with the minimum aggregated weight. Let the resulting 
aggregated weight of the dominating set be Wi. 

Step 5: Compute the minimum dominating pair, forming by a dominator and 
a non-dominator. Let the resulting weight of the pair be W 2 - 
Step 6 : Compute the minimum dominating pair with X-shaped or 8-shaped 
intersection. Let the resulting weight of the minimum dominating pair be W 3 . 
Step 1: The smallest value within the set {wi, W 2 ,W 3 } is the desired weight of 
the minimum weighted connected dominating set. Output the corresponded 
dominating set D by tracing back the predecessor links. 

End of WCDS 



Fig. 4. Finding the minimum weighted connected dominating set in trapezoid 
graphs. 
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Proof. The algorithm is shown in FigureEl Note that Step 1 to Step 3 takes 0{n) 
time by Lemma HI andO Step 4, by Lemma El can be done in 0(n log log n) 
time. Step 5 can be done within 0(n) time by Lemma 0 Step 6 is the most 
complicated case of the algorithm; Lemma 0 shows that it takes 0(n log log n) 
time. The rest of the proof appears in the complete paper. □ 



5 Concluding Remarks 

Note that previous results pm] solved the problems by checking all possible 
source and sink pairs; thus they had to spend 0{m + n) time for the unweighted 
case and 0{m + nlogn) time for the weighted case even when the trapezoid 
diagram was given. Note that the size of the trapezoid diagram is just 0{n), 
but these algorithms still spend potentially time when the size of edges 

is large. 

Recently, Kohler 0 presents an 0{n) time algorithm for finding the mini- 
mum cardinality connected dominating set in trapezoid graphs. Using different 
approach, our algorithm exploits the pencil structure within the trapezoid di- 
agram to find the minimum-sized connected dominating set in 0{n) time. For 
the weighted problem, our algorithm again uses the pencil structure as well as 
priority queues of finite integers m to find the minimum weighted connected 
dominating set in 0(n log log n) time. 
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Abstract. We consider the parameterized complexity of the following 
problem under the framework introduced by Downey and Fellows|^: 
Given a graph G, an integer parameter k and a non-trivial hereditary 
property 77, are there k vertices of G that induce a subgraph with prop- 
erty 77? This problem has been proved NP-hard by Lewis and Yanna- 
kakis0. We show that if 77 includes all independent sets but not all 
cliques or vice versa, then the problem is hard for the parameterized 
class W[l] and is fixed parameter tractable otherwise. In the former 
case, if the forbidden set of the property is finite, we show, in fact, that 
the problem is IU[l]-complete (see |2] for definitions). Our proofs, both of 
the tractability as well as the hardness ones, involve clever use of Ramsey 
numbers. 



1 Introduction 

Many computational problems typically involve two parameters n and k, e.g. 
finding a vertex cover or a clique of size fc in a graph G on n vertices. The 
parameter k contributes to the complexity of the problem in two qualitatively 
different ways. The parameterized versions of Vertex Cover and Undirected 
Feedback Vertex Set problems can be solved in 0(/(A:)n“) time where n is 
the input size, a is a constant independent of k and / is an arbitrary function 
of k (against a naive 0{rf^) algorithm for some constant c). This “good behav- 
ior”, which is extremely useful in practice for small values of k, is termed fixed 
parameter tractability in the theory introduced by Downey and Fellows |‘2III4j . 

On the other hand, for problems like Clique and Dominating Set, the 
best known algorithms for the parameterized versions have complexity 0{rf^) 
for some constant c. These problems are known to be hard for the parameterized 
complexity classes W[l] and VF[2] respectively and are considered unlikely to 
be fixed parameter tractable (denoted by FPT) (see 0 for the definitions and 
more on the parameterized complexity theory). In this paper, we investigate 
the parameterized complexity of finding induced subgraphs of any non-trivial 
hereditary property in a given graph. 

* Part of this work was done while the first author was at IIT Bombay and visited the 
Institute of Mathematical Sciences, Chennai 

D.-Z. Du et al. (Eds.): COCOON 2000, LNCS 1858, pp. 1.87- TT71 2000. 
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A graph property 77 is a collection of graphs. A graph property 77 is non- 
trivial if it holds for at least one graph and does not include all graphs. A 
non-trivial graph property is said to be hereditary if a graph G is in property 77 
implies that every induced subgraph of G is also in 77. A graph property is said 
to be interesting if the property is true (as well as false) for infinite families of 
graphs. Lewis and Yannakakisj^ (see also 0) showed that if 77 is a non-trivial 
and interesting hereditary property, then it is NP-hard to decide whether in a 
given graph, k vertices can be deleted to obtain a graph which satisfies 77. 

For a hereditary property 77, let T be the family of graphs not having the 
property. The set of minimal members (minimal with respect to the operation 
of taking induced subgraphs) of T is called the forbidden set for the property 
77. For example, the collection of all bipartite graphs is a hereditary property 
whose forbidden set consists of all odd cycles. Conversely, given any family T of 
graphs, we can define a hereditary property by declaring its forbidden set to be 
the set of all minimal members of T . 

CaiPj has shown that the graph modification problem for a non-trivial hered- 
itary property 77 with a finite forbidden set is fixed parameter tractable (FPT). 
This problem includes the node deletion problem addressed by Lewis and Yan- 
nakakis (mentioned above) . While the parameterized complexity of the question, 
when 77 is a hereditary property with an infinite forbidden set, is open, we ad- 
dress the parametric dual problem in this paper. Given any property 77, let 
P(G, k, 77) be the problem defined below. 

Given: A simple undirected graph G{V,E) 

Parameter: An integer k<\V\ 

Question: Is there a subset V' C V with \V'\ = k such that the subgraph 
of G induced by V has property 77 ? 

This problem is the same as ‘|P| — k’ node deletion problem (i.e. can we 
remove all but k vertices of G to get a graph with property 77) and hence NP- 
hard. However the parameterized complexity of this problem doesn’t follow from 
Cai’s result. We prove that if 77 includes all independent sets, but not all cliques 
or vice versa, then the problem P{G, k, 77) is VF[l]-complete when the forbidden 
set of 77 is finite and VF[l]-hard when the forbidden set is infinite. The proof is 
by a parametric reduction from the Independent Set problem. If 77 includes 
all independent sets and all cliques, or excludes some independent sets and some 
cliques then we show that the problem is fixed parameter tractable. 

Note, from our and Cai’s result, that the parameterized dual problems dealt 
with, have complimentary parameterized complexity. This phenomenon has been 
observed in a few other parameterized problems as well. In a graph G(V,E), 
finding a vertex cover of size k is FPT whereas finding an independent set of size 
k (or a vertex cover of size \V\ — k) is kF[I]-complete; In a given boolean 3-CNF 
formula with m clauses, finding an assignment to the boolean variables that 
satisfies at least k clauses is FPT whereas finding an assignment that satisfies 
at least (m — k) clauses (i.e. all but at most k clauses) is known to be IF[P]- 
hard H {k is the parameter in both these problems). The fc-iRREDUNDANT Set 
problem is kF[l]-hard whereas Co-Irredundant set or (n — k) Irredundant 
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Set problem is FPT jSj. Our result adds one other (general) problem to this 
list. 

Throughout the paper, by a graph we mean an undirected graph with no 
loops or multiple edges. By a non-trivial graph, we mean a graph with at least 
one edge. Given a graph G and A C V(G), by Ig{A) we mean the subgraph 
of G induced by vertices in A. For two graphs H and G, we use the notation 
H C G to mean that H is isomorphic to an induced subgraph of G. For the 
graph properties U we will be concerned with in this paper, we assume that II 
is recursive; i.e. given a graph G on n vertices, one can decide whether or not G 
has property 77 in /(n) time for some function of n. 

We had already defined the notion of fixed parameter tractable problems. To 
understand the hardness result, we give below some definitions. See ^ for more 
details. 

A parameterized language L is a subset of A* x TV where S is some finite 
alphabet and N is the set of all natural numbers. For {x, k) G L, k is the 
parameter. We say that a parameterized problem A reduces to a parameterized 
problem B, if there is an algorithm I> which transforms (x,k) into (x\g{k)) in 
time f{k)\x\°‘ where f,g : N —^N are arbitrary functions and a is a constant 
independent of fc, so that (x,k) S A if and only if {x',g{k)) G B. The essential 
property of parametric reductions is that if A reduces to B and if B is FPT, 
then so is A. 

Let 7^ be a family of boolean circuits with and, or and not gates; We allow that 
F may have many different circuits with a given number of inputs. Let the weight 
of a boolean vector be the number of I’s in the vector. To F we associate the pa- 
rameterized circuit problem Lp = {(G, k) : C accepts an input vector of weight 
7}. Let the weft of a circuit be the maximum number of gates with fan-in more 
than two, on an input-output path in the circuit. 

A parameterized problem L belongs to W\t] if L reduces to the parameterized 
circuit problem Lp(^f^ p) for the family F(t,h) of boolean circuits with the weft 
of the circuits in the family bounded by t, and the depth of the circuits in the 
family bounded by a constant h. This naturally leads to a completeness program 
based on a hierarchy of parameterized problem classes: 



FPT C VF[1] C W[2] C ... 



The parameterized analog of NP is VF[1], and LF[l]-hardness is the basic 
evidence that a parameterized problem is unlikely to be FPT. 

The next section deals with the hereditary properties for which the problem 
is fixed parameter tractable, and Section 3 proves the VF[l]-hardness result for 
the remaining hereditary properties. Section 4 concludes with some remarks and 
open problems. 
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2 Hereditary Properties That Are FPT to Find 

Lemma 1. If a hereditary property II includes all independent sets and all 
cliques, or excludes some independent sets as well as some cliques, then the 
problem P{G, k, II) is fixed parameter tractable. 

Proof. For any positive integers p and q, there exists a minimum number R{p, q) 
(the Ramsey number) such that any graph on at least R(j>, q) vertices contains 
either a clique of size p or an independent set of size q. It is well-known that 
R{p,i) < 

Assume that II includes all cliques and independent sets. For any graph G 
with |F(G)| > R{k, k), G contains either a clique of size k or an independent set 
of size k. Since all independent sets and all cliques have property U , the answer 
to the problem P{G, k, U) in this case is ”yes”. 

When \V{G) \ < R{k, k), we can use brute force by picking all fc-elements sub- 
sets of V (G) and checking whether the induced subgraph on the subset has prop- 
erty n. This will take time f (k) where f{k) is the time to decide whether 

a given graph on k vertices has property II. Thus the problem P{G, k, II) is fixed 
parameter tractable. 

If n excludes some cliques and some independent sets, let s and t respectively 
be the sizes of the smallest clique and independent set which do not have property 
77. Since any graph with at least R{s,t) vertices has either a clique of size s or 
an independent set of size t, no graph with at least R{s,f) vertices can have 
property 77 (since 77 is hereditary). Hence any graph in 77 has at most R{s,t) 
vertices and hence 77 contains only finitely many graphs. So if 7 > R{s,t), then 
the answer to the P{G, k, 77) problem is NO for any graph G. If fc < R{s, t), then 
check, for each k subset of the given vertex set, whether the induced subgraph 
on the subset has property 77. This will take time (^)/(7) < Gn^*^®7) for an n 
vertex graph, where G is time taken to check whether a graph of size at most 
R{s,t) has property 77. Since s and t depend only on the property 77, and not 
on k or n, and k < R{s,t), the problem P{G, k, 77) is fixed parameter tractable 
in this case also. □ 

We list below a number of hereditary properties 77 (dealt with in CD) each of 
which includes all independent sets and cliques, and hence for which the problem 
P{G, k, 77) is fixed parameter tractable. 

Corollary 1. Given any simple undirected graph G, and an integer k, it is fixed 
parameter tractable to decide whether there is a set ofk vertices in G that induces 
(a) a perfect graph, (b) an interval graph (c) a chordal graph, (d) a split graph, 
(e) an asteroidal triple free (AT-free) graph, (f) a comparability graph, or (g) a 
permutation graph. (See m or m for the definitions of these graphs.) 

3 Hereditary Properties That Are W-Hard to Find 

In this section, we show that the problem P(G, k, 77) is VF[l]-hard if 77 includes 
all independent sets but not all cliques or vice versa. 
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For a graph G, let G denote the edge complement of G. For a property 7T, 
let 7T = {G I G has property iT}. We note that II is hereditary if and only if II 
is hereditary, and II includes all independent sets but not all cliques if and only 
if n includes all cliques but not all independent sets. Thus it suffices to prove 
FF[l]-hardness when II includes all independent sets, but not all cliques. 

First we will show that, the problem is in W[l] if the forbidden set for 77 is 
finite. 

Lemma 2. Let II he a non-trivial hereditary property having a finite forbidden 
set T ={77i, 772, • ■ ■ , Hs}- Then the problem P{G, k, 77) is in VF[1]. 

Proof. Let j/ = max{\Hi\). Let Ai, . . . ,Aq be all the subsets of V{G) such that 
for every 1 < j < q, Ic{Aj) is isomorphic to some Hi. The sets Tlj’s can be 
determined in 0{f{v)n^) time by trying every subset of V{G) of size at most v. 
Here / is some function of v alone. 

Consider the boolean formula 

A|=i(VuGAj 

If this formula has a satisfying assignment with weight (the number of true 
variables in the assignment) fc, then the subset X of V{G) defined hy X = {u G 
V\xu = 1} is a subset of 14(G) of cardinality k such that Aj % X for any j. This 
implies that G has an induced subgraph of size k with property 77. 

Conversely if G has an induced subgraph of size k with property 77, then set- 
ting Xu = ^ for vertices u in the induced subgraph gives a satisfying assignment 
with weight k. Since q < and \Ai\ <v {a, constant) for all 7, it follows from 
the definitions that the problem is in W[l]. □ 

We now show that the problem is FF[l]-hard when one of the graphs in the 
forbidden set of 77 is a complete bipartite graph. 

Lemma 3. Let H he a hereditary property that includes all independent sets but 
not all cliques, having a finite forbidden set T ={77i, 772, • ■ • , Hs}. Assume that 
some Hi, say H\ is a complete bipartite graph. Then the problem P{G,k,H) is 
W[V\- complete. 

Proof. In Lemma El we have shown that the problem is in W[l]. 

Let 77 be as specified in the Lemma. Let t = max{\V\\, IV 2 I) where Vl IJ V 2 
is bipartition of 77i. If t = 1, 77i = K 2 , and the given problem P is identical 
to the 7-independent set problem, hence IF[l]-hard. So assume t > 2. Note that 
Hi C Kt^t- Let Hs be the clique of smallest size that is not in 77, hence in the 
forbidden set T. 

Now we will show that the problem is VF[l]-hard by a reduction from the 
Independent Set Problem. Let G\ be a graph in which we are interested in finding 
an independent set of size k\. For every vertex u G Gi we take r independent 
vertices ( r to be specified later) u^, . . . , u’’ in G. If {u,v) is an edge in Gi, we 
add all edges (u*, A) in G. G has no other edges. 
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We claim that that G\ has an independent set of size ki if and only if G has 
rki vertices that induce a subgraph with property U . 

Suppose G\ has an independent set {ui\l < i < ki} of size ki. Then the set 
of rki vertices {u^ | 1 < i < fci, 1 < j < r} is an independent set in G and hence 
has property 77. 

Conversely let S' be a set of rki vertices in G which induces a subgraph with 
property 77. This means 7 g(S) does not contain any Hi, in particular it does 
not contain H\. Group the rk\ vertices according to whether they correspond 
to the same vertex in G\ or not. Let Xi , . . . , Xh,Y\, . . . ,Yp be the groups and 
ui, . . . ,Uh,v\, . . . ,Vp be the corresponding vertices in G\ such that \Xi\ > 7 V i 
and \Yj\ < 7 V j. Observe that {ui , . . . , Uh} must be independent in Gi because 
if we have an edge (ui,Uj), Hi C Kt^t C Ic{Xi,Xj) C Ia{S), a contradiction. 
If 7 > fci we have found an independent set of size at least ki in Gi. There- 
fore assume that h < ki — 1. Then \^i\ — ~ 1) which implies that 

1^1 — or p > r/(t — 1). Since vertices in distinct groups (one vertex per 
group) in G and the corresponding vertices in Gi induce isomorphic subgraphs, 
the vertices vi, . . . ,Vp induce a subgraph of Gi with property 77 (since 77 is 
hereditary). Since this subgraph has property 77, it does not contain 77g as an 
induced subgraph. We choose r large enough so that any graph on r/{t — 1) 
vertices that does not contain a clique of size |77s| has an independent set of size 
ki . With this choice of r, it follows that Gi does contain an independent set of 
size ki- The number r depends only on |77s| and the parameter ki and not on 
ni = |y(Gi)|. So the reduction is achieved in 0(/(fci)m“) time where / is some 
function of ki and a is some fixed constant independent of 7i. □ 

Next, we will show that the problem is W[l]-hard even if none of the graphs 
in the forbidden set is complete-bipartite. 

Theorem 1. Let H he a hereditary property that includes all independent sets 
but not all cliques, hauing a finite forbidden set T ={77i, H^, . . . , 77s}. Then the 
problem P{G,k,H) is W[V\- complete. 

Proof. The fact that the problem is in W[l] has already been proved in Lemma 
121 Assume that none of the graphs Hi in the forbidden set of 77 is complete- 
bipartite. Let 77s be the clique of smallest size that is not in 77, hence in the 
forbidden set T. 

For a graph Hi in P, select (if possible) a subset of vertices Z such that the 
vertices in Z are independent and every vertex in Z is connected to every vertex 
in Hi\Z. Let {77.^ |1 < j < Sij be the set of graphs obtained from Hi by removing 
such a set Z for every possible choice of Z. Since Hi is not complete-bipartite, 
every Hij is a non-trivial graph. Let Pi= F \J{Hij\l < i < s,l < j < s^j. Note 
that Pi contains a clique of size |77s| — 1 because a set Z, consisting of a single 
vertex, can be removed from the clique 77s . Let Hi be the hereditary property 
defined by the forbidden set Pi. Observe that Hi also includes all independent 
sets but not all cliques. Let Pi be the problem P{Gi,ki,Hi). 

We will prove that Pi is VF[l]-hard later. Now, we will reduce Pi to the 
problem P{G, k, 77) at hand. 
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Given Gi, we construct a graph G as follows. Let V{G) = V(G\) IJ D where 
H is a set of r independent vertices (r to be specified later). Every vertex in 
V{G\) is connected to every vertex in D. Let v = maXi{\Hi\). 

We claim that Gi has an induced subgraph of size k\ with property Ui if 
and only if G has fci + r vertices that induce a subgraph with property 77. 

Let A be a subset of E(Gi), |A| = k\ such that Iq-^^{A) g 7Ti. Let S = A[J D. 
Suppose on the contrary that Ig{S) contains some Hi as a subgraph. If this Hi 
contains some vertices from D, we throw away these independent vertices. The 
remaining portion of Hi, which is some Hij, 1 < j < Si, must lie in Ig(A). But 
this is a contradiction because Ia{A) = Ig^(A) and by hypothesis, Ig^{A) has 
property H\ and it cannot contain any Hij. Similarly Hi cannot lie entirely in 
Ig{A) because IF C so Ig{A) does not contain any Hi as induced subgraph. 
Therefore Ig{S) does not contain any Hi, hence it has property 77 and IS”! = 
ki + r. 

Conversely, suppose we can choose a set S, |5| = fci + r such that Ig{S) 
does not contain any Hi. Since |77| = r we must choose at least ki vertices from 
V{Gi). Let A C ^(^^(Gi) with cardinality k\. If Ig{A) does not contain any 
Hij, we are through. Otherwise let Hi^j^ C Ig(A) for some io, Jq. Now Hi^jg 
is obtained from Hig by deleting an independent set of size at most ly. Hence 
S can contain at most v — 1 vertices from D, otherwise we could add sufficient 
number of vertices from D to the graph Hi^jg to get a copy of Hig which is not 
possible. Hence I^Pl -D| < v which implies that [S' p| I/(Gi)| > k\+r — v. Thus 
7 g^(S'P| E(Gi)) is an induced subgraph of Gi of size at least k\ + r — v that 
does not contain any Hi, in particular it does not contain 77^ which is a clique 
of size say /r. We can select r (as before, by Ramsey Theorem ) such that any 
graph on 7i + r — vertices that does not contain a /r-clique has an independent 
set of size k\. Hence G\ has an independent set of size k\ which has property 
H\. The number r depends only on the family T and parameter k\ and not on 
rii = |R(Gi)|. So the reduction is achieved in 0{g{ki)ni^) time where g is some 
function of fci and /3 is a constant. Also |R(G)| = |R(Gi)| + r, so the size of the 
input problem increases only by a constant. 

We will be through provided the problem Pi is W[l]— hard. If any of the Hij 
is complete-bipartite, then it follows from Lemma 0 Otherwise, we repeatedly 
apply the construction, given at the beginning of the proof, of removing set 
Z of vertices from each graph in the forbidden set, to get families ^ 3 , ■ ■ ■ 
and corresponding problems P 2 , P 3 , . . . such that there is a parametric reduction 
from Pm+i to to Pm. Since Pm+i contains a smaller clique than a clique in 
Pm, eventually some family Pmo contains a clique of size 2 (the graph K 2 ) 
or a complete-bipartite graph. In the former case, the problem Pmg is same as 
parameterized independent set problem, so W[l]-hard. In the latter case Pmo is 
W[l]-hard by Lemma El □ 

We can extend Theorem d to the case when the forbidden set is infinite. 
However we could prove the problem only W[l]-hard; we don’t know the precise 
class in the W-hierarchy the problem belongs to. 
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Theorem 2. Let II be a hereditary property that includes all independent sets 
but not all cliques (or vice versa). Then the problem P{G,k,II) is W\l]-hard. 

Proof. Every hereditary property is defined by a (possibly infinite) forbidden 
set jO] and so let the forbidden family for II he T = The proof 

is almost the same as in Theorem DJ Note that Lemma 0 does not depend on 
finiteness of the forbidden family. Also the only point where the finiteness of T 
is used in Theorem 0 is in the argument that if Ig(^) does contain some i?ioio 
then S can contain at most v — 1 vertices from the set D. This argument can 
be modified as follows. Since Ig{^) contains some \V{Higjg)\ < |A| = ki. 

Also Higjg is obtained from some Hi by removing an independent set adjacent 
to all other vertices of Hi. (If there are more than one such HiS from which 
is obtained, we choose an arbitrary Hi.) Let vi = max {\V{Hi)\ — \V{Hij)\) 
where the maximum is taken over all Hij such that \V{Hij)\ < k\. Hence if 
Ig{A) does contain some Higjg, we can add at most vi vertices from D to get 
Hi^. So S must contain less than vertices from D. The choice of r will have 
to be modified accordingly. □ 

Corollary 0 follows from Theorem 0 since the collection of forests is a hered- 
itary property with the forbidden set as the set of all cycles. This collection 
includes all independent sets and does not include any clique of size > 3. 

Corollary 2. The following problem is W[l]-hard: 

Given {G,k), does G have k vertices that induce a forest? 

This problem is the parametric dual of the Undirected Feedback Vertex 
Set problem which is known to be fixed parameter tractable Pj. 

Corollary 3. Following problem is W[l]-complete: 

Given (G,k), does there exist an induced subgraph of G with k vertices that is 
bipartite ? 

Proof. Hardness follows from Theorem El since all independent sets are bipartite 
and no clique of size at least 3 is bipartite. 

To show that the problem is in 1U[1], given the graph G, consider the boolean 
formula 



A«gK(G)(^“ ^ A(u,u)g£;(G)((®“ ^ *«) AiVu Vv)) 

We claim that G has an induced bipartite subgraph of size k if and only if 
the above formula has a satisfying assignment with weight k. Suppose G has an 
induced bipartite subgraph with k vertices with partition Vi and V 2 . Now for each 
vertex in Vi assign Xu = l,yu = 0, for each vertex in V 2 assign = 0,y„ = 1 
and assign Xu = yu — 0 for the remaining vertices. It is easy to see that this 
assignment is a weight k satisfying assignment for the above formula. 

Conversely, if the above formula has a weight k satisfying assignment, the 
vertices u such that a;„ = 1, = 0 or a;„ = 0, ?/„ = 1 induce a bipartite subgraph 

of G with k vertices. 

The corollary follows as the above formula can be simulated by a lU[l]-circuit 
(i.e. a circuit with bounded depth, and weft 1). □ 
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Corollary El can be proved along similar lines of Corollary 01 

Corollary 4. Following problem is W[l]- complete: Given (G, k) and a constant 
I, does there exist an l-colorable induced subgraph of size k? 

Finally we address the parametric dual of the problem addressed in Corollary 
01 Given a graph G, and an integer k, are there k vertices in G whose removal 
makes the graph bipartite? We will call this problem ‘n — k bipartite’. 

The precise parameterized complexity of this problem is unknown since 
though bipartiteness is a hereditary property, it has an infinite forbidden set, 
and so the problem is not covered by Cai’s result P . 

The ‘edge’ counterpart of the problem, given a graph G with m edges, and 
an integer k, are there k edges whose removal makes the graph bipartite, is the 
same as asking for a cut in the graph of size m — fc. It is known [TT!j that there 
exists a parameterized reduction from this problem to the following problem, 
which we call ‘all but k 2-SAT’. 

Given: A Boolean 2 CNF formula F 
Parameter: An integer k 

Question: Is there an assignment to the variables of F that satisfies all but 
at most k clauses of FI 

We show that there is also a parameterized reduction from the ^n—k bipartite 
problem’ to the ‘all but k 2-SAT’ problem. 

Theorem 3. There is a parameterized reduction from the ‘n — k bipartite prob- 
lem’ to the ‘all but k 2-SAT’ problem. 

Proof. Given a graph G, for every vertex, we set two variables (a;„, y„) and con- 
struct clauses in the same manner as in the proof of Corollary 0 The clauses are 
as follows: 

Set 1: 



Xu^ Vu y ueV{G) 

y{u,v)GE{G) 

Each clause in Set 1 is repeated fc -I- 1 times. 

Set 2 : Xu y Vu V m G ^(G). 

We show that it is possible to remove fc vertices to make the given graph bipartite 
if and only if there is an assignment to the variables in the above formula that 
makes all but at most fc clauses true. 

If there is an assignment that makes all but at most fc clauses true, then 
the clauses in Set 1 must be true because each of them occurs fc -|- 1 times. 
This ensures that the variables Xu,yu corresponding to the vertices are assigned 
respectively 0,0 or 0,1 or 1,0 and each edge e = (m, v) has a;„ = a;„ = ?/„ = ?/„ = 0 
or a;„ = 0,?/„ = 1 and a;„ = 1, ?/„ = 0 or vice versa. The vertices s for which 
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Xs = Us = 0 are removed to get a bipartite graph. At most k clauses in Set 2 are 
false. This ensures that at most k vertices are removed. 

Conversely if there exist k vertices whose removal results in a bipartite graph 
with partition Vi IJ V 2 , consider the assignment corresponding to each vertex u 
in the graph, a;„ = ?/„ = 0 if the vertex u is removed, = 1, ?/„ = 0 if u G Vi 
and = 0, = 1 if M € C 2 - 

It is easy to see that this assignment makes all but at most k clauses of the 
formula true. 

Note that the reduction is actually a polynomial time reduction. □ 

4 Concluding Remarks 

We have characterized the hereditary properties for which finding an induced 
subgraph with k vertices having the property in a given graph is W[l]- hard. In 
particular, using Ramsey Theorem, we have shown that if the property includes 
all independent sets and all cliques or if it excludes some independent sets as 
well as cliques, then the problem is fixed parameter tractable. However, for some 
of these specific properties, we believe that a more efficient fixed parameter 
algorithms (not based on Ramsey numbers) is possible. 

It remains an open problem to determine the parameterized complexity of 
both the problems stated in Theorem 0 (the ‘n — k bipartite problem’ and the 
‘all but k 2-SAT’ problem). More generally, the parameterized complexity of the 
node-deletion problem for a hereditary property with an infinite forbidden set is 
open. 

Finally we remark that our results prove that the parametric dual of the prob- 
lem considered by Caip (and was proved FPT) is VF[l]-hard. This observation 
adds weight to the conjecture (first made in 1 1 1 )j 1 that typically parametric dual 
problems have complimentary parameterized complexity. It would be interesting 
to explore this in a more general setting. 
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Abstract. We study a modification of digital trees (or tries) with adaptive 
multi-digit branching. Such tries can dynamically adjust degrees of their nodes 
by choosing the number of digits to be processed per each lookup. While we do 
not specify any particular method for selecting the degrees of nodes, we assume 
that such selection can be accomplished by examining the number of strings 
remaining in each sub-tree, and/or estimating parameters of the input 
distribution. We call this class of digital trees adaptive multi-digit tries (or 
AMD-tries) and provide a preliminary analysis of their expected behavior in a 
memoryless model. We establish the following results: 1) there exist AMD-tries 
attaining a constant (0(1)) expected time of a successful search; 2) there exist 
AMD-tries consuming a linear (O(n), n is the number of strings inserted) 
amount of space; 3) both constant search time and linear space usage can be 
attained if the (memoryless) source is symmetric. We accompany our analysis 
with a brief survey of several known types of adaptive trie structures, and show 
how our analysis extends (and/or complements) the previous results. 



1 Introduction 

Digital trees (also known as radix search trees, or tries) represent a convenient way 
of organizing alphanumeric sequences (strings) of variable lengths that facilitates 
their fast retrieving, searching, and sorting (cf. [11,17]). If we designate a set of n 
distinct strings as S ={^j,...,5„} , and assume that each string is a sequence of symbols 

from a finite alphabet Z = {Oj, , |S| = v, then a trie T(S) over S can be 
constructed recursively as follows. If n = 0 ( n = |5| ), the trie is an empty external 
node. If n = 1 (i.e. S has only one string), the trie is an external node containing a 
pointer to this single string in 5 . If w > 1 , the trie is an internal node containing v 
pointers to the child tries: T(Si),...,T(S„) , where each set S- (l</<v) contains 

suffixes of all strings from S that begin with a corresponding first symbol. For 
example, if a string Sj = u -Wj ( u . is a first symbol, and w, is a string containing the 

remaining symbols from s.), and u.=a., then the string w. will go into S-. Thus, 
after all child tries are recursively processed, we arrive at a tree-like 

data structure, where the original strings S ={ij,...,i„} can be uniquely identified by 
the paths from the root node to non-empty external nodes (cf. Fig l.a). 
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A simple extension of the above structure is obtained by allowing more than one 
symbol of input alphabet to be used for branching. Thus, if a trie uses some constant 
number of symbols r > 1 per lookup, then the effective branching coefficient is v ' , 
and we essentially deal with a trie built over an alphabet "L' (cf. Fig.l.c). 
To underscore the fact that branching is implemented using multiple input symbols 
(digits) at a time, such tries are sometimes called multi-digit tries (cf. [2]). 





(a) regular trie (r=l) (b) N-tree (adaptive radix trie) 





(c) multidigit trie (r=2) (d) LC-trie (adaptive multi-digit trie) 

Fig. 1. Examples of tries built from 7 binary strings: sl=0000..., s2=0001..., s3=0010..., 
s4=0100..., s5=0110..., s6=10..., s7=ll... . 



The behavior of regular tries is thoroughly analyzed (cf. [5, 9, 10, 11, 16, 19]). For 
example, it has been shown (cf. [16]), that the expected number of nodes examined 
during a successful search in a v-ary trie is asymptotically 0(logM//i) , where h is the 
entropy of a process used to produce n input strings. The expected size of such trie is 
asymptotically 0(nv//2) . These estimates are known to be correct for a rather large 
class of stochastic processes, including memoryless, Markovian, and y/ -mixed 
models (cf. [16, 2]). In many special cases (e.g. when a source is memoryless, or 
Markovian) the complete characterizations (expectation, variance and higher 
moments) of these parameters have been obtained, and their precise (up to 0(1) term) 

asymptotic expansions have been deducted (cf. [19, 10]). 

Much less known are modifications of tries that use adaptive branching. That is, 
instead of using nodes of some fixed degree (e.g., matching the cardinality of an input 
alphabet), adaptive tries select branching factor dynamically, from one node to 
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another. Perhaps the best-known example of this idea is sorting by distributive 
partitioning, due to Dobosiewicz (cf. [6]). This algorithm (also known as an N-tree 
(cf. Ehrlich [8]) selects the degrees of nodes to be equal exactly the number of strings 
inserted in the sub-tries they originate. For example, the N-tree displayed on Fig.l.b, 
contains seven strings overall, and therefore its root node has seven branches. In turn, 
its first branch receives three strings to be inserted, and thus the N-tree creates a node 
with three additional branches, and so on. 

While N-trees have extremely appealing theoretical properties (thus, according to 
Tamminen [20], they attain a constant (0(l)) expected search time, and use a linear 

0(n) amount of memory), there are several important factors that limit their practical 

usage. The main problem is that the N-tree is not a dynamic data structure. It is more 
or less suitable for a multi-pass construction when all n strings are given, but an 
addition or removal of a string in an existing structure is rather problematic. In the 
worst case, such an operation may involve the reconstruction of the entire tree, 
making the cost of its maintenance extremely high. Somewhat more flexible is a B-b 
parametrized version of the distributive partitioning proposed in [7]. This algorithm 
selects the branching factors to be equal njb ( > 1 ), and split child nodes only when 

the number of strings there becomes larger than B. When both B and b equal one, we 
have a normal N-tree, however, when they are large, the complexity of updates can be 
substantially reduced. 

Unfortunately, this does not help with another problem in adaptive tries. Since the 
degrees of nodes are being selected dynamically, N-trees cannot use simple per- 
symbol lookups. Instead, they must implement dynamic conversions from one size 
alphabet to another, or even treat the input strings as real numbers in [0,1) (cf. [6, 8]). 

In either scenario, the transition between levels in N-trees (and their B-b variants) is 
much slower than one in regular tries, and combined with the complexity of the 
maintenance, it creates serious constraints for the usage of N-trees in practice. 

In this paper, we focus on another implementation of adaptive branching that 
promises to be (at least partially) free from the above mentioned shortcomings. We 
are trying to create an adaptive version of multi-digit tries by allowing the number of 
digits processed by their nodes (parameter r) to be selected dynamically. Due to the 
luck of the standard name, we will call these structures adaptive multi-digit tries (or 
AMD-tries). To cover a wide range of possible implementations of such tries, we (at 
least initially) do not specify any particular algorithm for selecting the depths 
(numbers of digits) of their nodes. Instead, we assume that such selection can be 
accomplished by examining the number of strings remaining in each sub-tree, and 
estimating parameters of the input distribution, and attempt to study the resulting 
class of data structures. The goal of this research is to find performance bounds (in 
both search time and space domains) attainable by AMD-tries, and deduct several 
particular implementations, that can be of interest in practice. 

Somewhat surprising, there were only few attempts to explore the potential of the 
adaptive multi-digit branching in the past. Perhaps the only studied implementation in 
this class is a level-compressed trie (or LC- trie) by Andersson and Nilsson (cf. [2]). 
The node depth selection heuristic in a LC-trie is actually very simple; it combines the 
(v-ary) levels of the corresponding regular trie until it reaches the first external node 
(cf. Fig l.c). It has been shown (cf. [2, 15]), that in a memoryless model this algorithm 
produces nodes with depths r =Er = 0{\ogk-\og\ogk), where k is the number of 
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strings in a sub-trie originated by the node. When source is symmetric, the expected 
search time in a LC-trie is only o(log*n), however, it grows as O(loglogn) in the 
asymmetric case (cf. [15]). 

In our analysis, we extend the above results, and in particular, we show that there 
exist implementations of AMD-tries attaining the constant (0(1)) complexity of a 
successful search in both symmetric and asymmetric cases. Moreover, if source 
distribution is symmetric, such tries can be implemented in linear (0{n ) ) amount of 

space. Compared to an N-tree, an equally fast AMD-trie appears to have a larger 
memory usage, however, it is a much more suitable scheme for dynamic 
implementation, and combined with the benefits of fast multi-digit processing, it 
promises to be a structure of choice for many practical situations. 

This paper is organized as follows. In the next section, we will provide some basic 
definitions and present our main results. The proofs are delayed until Section 3, where 
we also derive few recurrent expressions necessary for the analysis of these 
algorithms. Finally, in our concluding remarks, we emphasize the limitations of our 
present analysis and show several possible directions for future research. 



2 Definitions and Main Results 

In our analysis, we only consider AMD-tries built over strings from a binary alphabet 
X = {0,1} , but the extension to any finite alphabet is straightforward. We also limit our 

study to a situation when n strings to be inserted in a trie are generated by a 
memoryless (or Bernoulli) source (cf. [3]). In this model, symbols of the alphabet X 
occur independently of one another, so that if x . - is the j-th symbol produced by this 

source, then for any j : Pr{x^ = oj = p , and Pr{x . =l} = ^ = l- p.If p-q-\j2, such 

source is called symmetric, otherwise it is asymmetric (or biased). In our analysis, we 
will also use the following additional parameters of memoryless sources: 
h = -plogp-qlogq-, = “log 

^2 = Plog"P + ^log"?; ??-»=- log Pmax; 

where: h is the (Shannon’s) entropy of the source (cf. [3]), and rj „ are special cases 
of the Renyi’s k-order entropy (cf. [18]): = -}-log(p‘*‘ -t = maxjp,^} , 

and p^^ = min{p,i 5 r| . Observe that , p „ , and h have the following relationship: 

rj^<h<rj^ ( 2 ) 

(the equality is attained when the source is symmetric, however in the asymmetric 
case, these bounds are rather weak). 

In this paper, we will evaluate two major the performance characteristics of the 
AMD-tries: the expected time of a successful search and the expected amount of 
memory used by a trie built over n strings. To estimate the search time we can use the 
expected depth or the external path length (i.e. the combined length of paths from 
root to all non-empty external nodes) C„ in a trie = C„/n . To estimate the size of a 
trie we will use the expected number of its branches . Note that the last metric is 
slightly different from one used for the regular tries (cf. [11, 9]). In that case, it was 




152 



Yuriy A. Reznik 



sufficient to find the number of internal nodes 4 , in a trie. However, since internal 
nodes in adaptive tries have different sizes by themselves, we need to use another 
parameter ( ) to take into account these differences as well. 

As we have mentioned earlier, we allow AMD-tries to use an arbitrary (but not 
random) mechanism for selecting the depths of their nodes. However, in a Bernoulli 
model, there are only two parameters that can affect the expected outcome of such 
selection applied to all sub-tries in a trie', a) the number of strings inserted in a sub- 
trie, and b) the parameters of source distribution. Thus, without significant loss of 
generality or precision, we can assume that node-depth selection logic can be 
presented as a (integer-valued) function: 

r„'-r{n,p), (3) 

where n indicates the number of strings inserted in the corresponding sub-trie, and p 
is a probability of 0 in the Bernoulli model. 

We are now ready to present our main results regarding the expected behavior of 
AMD-tries. The following theorem establishes the existence of AMD-tries attaining 
the constant expected time of a successful search. 



Theorem 1. There exist AMD-tries such that: 



where ^^,^2 some positive constants. The upper bound holds if: 


(4) 


and the lower bound holds if: 




(5) 




) = -A(iogn-loglog<Ji) + 0 (i) . 


( 6 ) 


Notice that the above conditions are 


based on parameters T]„and 


, which, in 



turn, can be considered as bounds for the entropy h of the source (2). This may 
suggest that there should be a more accurate way to estimate the depth of nodes 
needed to attain a certain performance (and vice versa), expressed in terms of the 
entropy h. The following theorem answers this conjecture in affirmative. 

Theorem 2. Let 

r^=lognlh, (7) 

and assume that the actual depths of nodes in an AMD-trie are selected to be 
sufficiently close to r ‘ : 

= ( 8 ) 

Then the complexity of a successful search in such trie is (with n — > ~ ): 

(<7 Viog « )) (1 + (1/ " )) ’ (9) 

where: 

is the distribution function of the standard normal distribution (cf.[l]), and cr = ^ 2 ^ . 

Now, we will try to evaluate AMD-tries from the memory usage perspective. The 
next theorem establishes bounds for AMD-tries that attain a linear (with the number of 
strings inserted) size. 
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Theorem 3. There exist AMD-tries such that: 
where ^^,^2 some positive constants. The upper bound holds if: 

^ + i '^-1 )) + 0 ( i ) 

= 7ir(log« - A(w,k- 2,4) - loglog^j) + 
and the lower bound holds if: 

^ i5i2 (i°g" + log'll + )) + * 5 ( 7 ) 

= ^ (log n - A (n, /Cl , ) - log log ) + O ( 
where: /Cj =r]„/log2 , /Cj =r]_„/log2 , 'W_^[x) is a branch Vf (x) < -1 of the Lambert W 
function: VT(x)e"'*** =x (cf. [4]), and 

A(n,/c,^) = log(l + (/flogn-log(n/c))/(/clog§)) . (14) 

Observe that in the symmetric case ( p = ijr = 1/2 ), we have /Cj = /Cj = 1 , and thus, the 
term (14) becomes zero: A(n,l,^) = 0 . The resulting bounds for (12) and (13) will 

be identical to one we have obtained in the Theorem 1 (5), and (6), and thus, we have 
the following conclusion. 

Corollary 4. In the symmetric Bernoulli model, an AMD-trie can attain a 
constant expected depth = C„//r = 0(1) and have a linear size B^=0[n) at the 
same time. 

Unfortunately, in the asymmetric case the situation is not the same. Thus, if k^ \ , 
the term A(n,/c,^) can be as large as O (log log /r) , and the conditions for the constant 
time search (5), (6) may not hold. Actually, from the analysis of LC-tries (cf.. [15]), 
we know that the use of depths /- = 0(log/r-loglogn) leads to the expected search 

time of O(loglogn) , and it is not yet clear if this bound can be improved considering 
the other linear in space implementations of the AMD-tries. 

3 Analysis 

In this section, we will introduce a set of recurrent expressions for various parameters 
in AMD-tries, derive some necessary asymptotic expansions, and will sketch the 
proofs of our main theorems. 

For the purposes of compact presentation of our intermediate results, we will 
introduce the following two variables. A sequence {x„} will represent an abstract 

property of AMD-tries containing n strings, and a sequence {y„} will represent a 
property of root nodes of these tries that contribute to {x„} . The following mapping to 
the standard trie parameters is obvious: 



( 11 ) 

( 12 ) 

(13) 
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Xn \ B„ C„ 

y„ 1 2 '" n 



( 15 ) 



where: \ is the average number of internal nodes, is the average number of 



branches, C„ is the external path length in an AMD-trie containing n strings. 



Using the above notation, we can now formulate the following recurrent 
relationship between these parameters of AMD-tries. 

Lemma 1. The properties jc„ and of an AMD-trie in a Bernoulli model satisfy: 



Xn=yn 



k=2 s=0 



(16) 



where: n is the total number of strings inserted in an AMD-trie, and the rest of 
parameters are as defined in (3) and (16). 

Proof. Consider a root node of depth in an AMD-trie. If n < 2 the result is 0 by 



definition. In n > 2 , the property of trie jc„ should include a property of a root node y„ 



plus the sum of properties of all the child tries. It remains to enumerate child tries and 
estimate probabilities of them having 2<k<n strings inserted. This could be done 
using the following approach. 

Assume that each of the 2'- possible bit-patterns of length has a probability of 



occurrence itt, = Pr {Bin, = /} (Bin,.(Sj) means first bits of a string s^), 



0<i<2'" , ^;7r,=l, and that this is a memory less scheme (itt, is the same for all 
strings). Then, the probability that exactly k strings match the i-th pattern 
is ( f (l - w, )‘^ * . The i-th pattern corresponds to a child trie only if there are at least 
two strings that match it. Therefore, the contribution of the /-th pattern to the property 
x„ is {l-Ttif x^ , and it remains to scan all 2'" patterns to obtain the 

k>2 

complete expression for . 

Recall now, that the actual strings we are inserting in the trie are produced by a 
binary memoryless source with probability of 0 equal p. Therefore, the probabilities 
of the -bit sequences produced by this source will only depend on the number of 0 ’ s 



(^) they contain: tt (s) = p‘ q''- ’. Also, given s zeros, the total number of patterns 
yielding this probability is ( (■ ) . Combining all these formulas, we arrive at 

S ' X, , (17) 

0<.^<r„ k>2 

which, after the addition of y„ yields the claimed expression (16). 

It should be stressed that due to dependency upon the unknown parameter the 
rigorous analysis of a recurrent expression (16) appears to be a very difficult task. It is 
not clear for example, if it is possible to convert (16) to a closed form (for any of the 
parameters involved). Moreover, the attempts to use some particular formulas (or 
algorithms) for can actually make the situation even more complicated. 

The analysis of (16) that we provide in this paper is based on a very simple 
approach, which nevertheless is sufficient to evaluate few comer cases in the behavior 
of these algorithms. Thus, most of our theorems claim the existence of a solution in 
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linear form, and we use (16) to find conditions for such that the original claim 
holds. We perform the first step in this process using the following lemma. 

Lemma 2. Let and be two positive constants such that 

(18) 

Then, the recurrent expression (16) holds when: 

f{n,P,r„)<^2 , (19) 

where: 

,t=0 

Proof. Consider the upper bound in (18) first, and substitute with the 

right side of (16). This yields: 

^ + S ( .? ) X ( t )( '9'" “ ^’ 9 '” " )" ' ^2^ = 

■’=“ ^ (21) 

,t=0 ' ' 

Now, according to (18), x„ <^ 2 ^^ , and combined with (21), the upper bound holds 
only when - ^ 2 / [n,p,rf)<Q . Hence > y„ // (n, p, ) , and repeating this procedure 
for the lower bound ( ), we arrive at formula (19), claimed by the lemma. 

The next step in our analysis is to find bounds for the sum (20) that would allow us 
to separate . The following lemma summarizes few such results. 

Lemma 3. Consider a function f{n,p,rf) defined in (20). The following holds: 



/(”’iT„) = M(l-2 ‘ , (22.a) 

/(n,p,r„)<w(l-e“''"'‘-) < nexp(-(w -l)e“''"'‘- ) , (22. b) 

f (n,p,rf)>n{l-e^'"''~^ > wexp|-(n - l)e“'"‘~^(l , (22. c) 

In addition, if n,r„ , and |r„ -logw// 2 | =o(^logn// 2 ^ , then asymptotically: 

= - log n//i)/(a,yiog n )) (1 + O (l/^log n ) j , ( 22 .d) 

where ®(x) and a are as defined in the Theorem 2 (7-10). 



Proof. The equality for the symmetric case (22. a) follows from by direct 
substitution. Two other bounds (22.b) and (22. c) can be obtained with the help of the 
following estimate: 

g-M. _ „'■» < < p'" 

t imn / T t max ’ 

where p^^ , p ^^ , , and 77 ^ are as defined in (1). We apply these bounds to the last 

component in the sum ( 20 ): 

/ \H— 1 / \«— 1 / „ \H— 1 

(l-e“"”--) <(l-p'^"“) <(l-e“""-) , 

which allows the first portion of sum ( 20 ) to converge: Z(?)pV-’=i. 

The additional (right-side) inequalities in (22.b) and (22.c) are due to: 
g-</(i-) <i-x<e-L 0 <jc<l, (cf. [1], 4.2.29). 
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The derivation of an asymptotic expression (22. d) is a complicated task, and here 
we will rather refer to a paper of Louchard and Szpankowski (cf. [14]) which 
discusses it in detail. A somewhat different approach of solving it has also been 
provided in [12] and [13]. 

Now using the results of the above lemmas, we can sketch the proofs of our main 
theorems. Consider the problem of evaluating the behavior of AMD-tries from the 
expected complexity of a successful search perspective first. 

We can use a recurrent expression (16) where, according to (15), we can substitute 
jc„ with a parameter of the external path length C„ , and y„ with n. Observe that the 
bounds for C„ (4) claimed by the Theorem 1 and condition (18) in Lemma 2 are 
equivalent, and the combination of the result of this lemma (18) with ( 22 .b) and ( 22 .c) 
yields the following: 

< (l - /"<«// («, p, ) < (l - )' " < ^2 • (23) 

Here, it remains to find an expression for such that 1 < <?1 ^ ^2 < “ regardless of n . 
Considering the right part of 18 (upper bound), the last requirement is obviously 
satisfied when e ^ . More precisely, for any given constant we need: 

'■« - TT (l - exp (^^)) = ^(log M - log log ^2 ) + O (i) , 

which is exactly the condition (5) claimed by the Theorem (1). The expression for the 
lower bound ( 6 ) is obtained in a similar way. 

The result of the Theorem 2 follows directly from (19), (20) and asymptotic 
expression ( 22 . d). 

To evaluate the expected size an AMD-trie in memory we will use a very similar 
technique. Consider a recurrent expression (16) withjc„ substituted by the expected 



number of branches , and with 2 '" . Following the Theorem 3 and Lemma 2, we 
will assume that is bounded ( 11 ), and using (18) combined with ( 22 . b) and ( 22 . c) 
we arrive at: 



<J, < T" n' (l - y " < 2''"// (w, P, r„ ) < 2'’ n' (l - e"''""- )‘ " < <?2 ■ (24) 

Compared to (23) this appears to be a slightly more complicated expression, which 
does not yield an exact solution for (unless the source is symmetric). To find an 
asymptotic (for large n) solution of (24), we will write: 






1-exp 



-log(|2«2 






V V yj 

and after some manipulations we arrive at: 

'■» + + (25) 

where according to ( 12 ) ^2 = 77 ^ /log 2 , and W_^{x) is a branch lT(x)<-lof the 
Lambert W function = jc (cf. [4]). To perform further simplifications in 

(25), we can use the following asymptotic expansion of (cf. [4]): 



IT 1 (x) = log (-x) - log (-log (-x)) + < X < 0 , 



(26) 
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which, after some algebra yields the second part of inequality (12) claimed by the 
Theorem 3. The expression for the lower bound (13) is obtained in essentially the 
same way. 



4 Concluding Remarks 

We absolutely believe that AMD-tries have great potential that should be explored in 
practice. To that end, our conclusion that they attain 0(1) expected search time, while 
preserving all the benefits of the regular tries, should be a good starting point. 

At the same time, AMD-tries pose several interesting theoretical problems, and 
from this perspective, our results just scratch the surface. Our analysis was only 
sufficient to produce (rather coarse) bounds for the expected characteristics of AMD- 
tries, and the derivation of their exact expressions (including lower-magnitude terms 
and oscillating components) remains an open (and difficult) problem. 

Another interesting problem is to find parameters of an AMD-trie that uses the 
minimum possible amount of memory. Thus, analyzing the set of expressions in the 
proof of the Theorem 3 we can conjecture that such trie exists, but finding its actual 
parameters requires a more solid analytical framework. 
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Abstract. Ulam and Renyi asked what is the minimum number of yes- 
no questions needed to find an unknown m-bit number x, if up to I 
of the answers may be erroneous/mendacious. For each i it is known 
that, up to only finitely many exceptional m, one can find x asking 
Berlekamp’s minimum number qe{m) of questions, i.e., the smallest inte- 
ger q satisfying the sphere packing bound for error-correcting codes. The 
Ulam-Renyi problem amounts to finding optimal error-correcting codes 
for the binary symmetric channel with noiseless feedback, first considered 
by Berlekamp. In such concrete situations as optical transmission, error 
patterns are highly asymmetric — in that only one of the two bits can be 
distorted. Optimal error-correcting codes for these asymmetric channels 
with feedback are the solutions of the half-lie variant of the Ulam-Renyi 
problem, asking for the minimum number of yes-no questions needed 
to find an unknown m-bit number x, if up to I of the negative answers 
may be erroneous /mendacious. Focusing attention on the case £ = 1, in 
this self-contained paper we shall give tight upper and lower bounds for 
the half-lie problem. For infinitely many m’s our bounds turn out to be 
matching, and the optimal solution is explicitly given, thus strengthening 
previous estimates by Rivest, Meyer et al. 



1 Introduction 

In his autobiographical book “Adventures of a Mathematician” p~fl p.281], Ulam 
posed the problem of optimal binary search in the presence of faulty tests. Inde- 
pendently, the very same problem had been formulated by Renyi jSl p.47] . In fact, 
the search of an unknown m-bit number by asking the minimum number of “yes- 
no” questions, when up to £ of the answers may be erroneous/mendacious, is the 
same as the problem of finding shortest Uerror-correcting codes for Berlekamp’s 
noiseless delayless feedback channel Q . 

As a typical example, let us consider a Transmitter sending binary messages 
(e.g., m-bit numbers) on a noisy channel. Assume that the received bits can 

D.-Z. Du et al. (Eds.): COCOON 2000, LNCS 1858, pp. ISO-ESHI 2000. 

(c) Springer- Verlag Berlin Heidelberg 2000 
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be sent back to the Transmitter on a noiseless channel. Thus Transmitter-to- 
Receiver bits are interleaved with Receiver-to-Transmitter bits. In other words, 
before sending the (i + l)th bit, the Transmitter does know which ones among 
the previously sent i bits b have been distorted and received as 1 — 6 because of 
the noise effect. The main problem here is to minimize the number of bits sent 
by the Transmitter, while still guaranteeing that the m-bit number can be fully 
recovered by the Receiver. 

Assuming the 1 and 0 bits to be equally subject to distortion, it is known that 
an optimal solution to this problem is given by sending qe(rn) bits, where qe(rn) 
is the smallest integer q satisfying the Sphere Packing Bound J2j-o (]) — 2'^“’”. 
Trivially, no solution exists using less than qi{m) bits. 

In other concrete situations the distribution of errors/distortions is highly 
asymmetric: for instance, in optical communication systems photons may fail to 
be detected (1^0), but the creation of spurious photons (0^1) is impossible 0. 
Similarly, in most LSI memory protection schemes one can safely assume that 
only one of the two possible error patterns (1^0) and (0^1) can occur 0. 

Optimal error-correcting codes for these asymmetric channels with feedback 
are the same as shortest binary strategies to find an m-bit number, when only 
negative tests can be erroneous. 

To the best of our knowledge, the first paper dealing with this “half-lie” vari- 
ant of the Ulam-Renyi problem was 0 . The authors proved an asymptotic result 
to the effect that the half-lie game has the same complexity as the original Ulam 
problem — henceforth referred to as the “full- lie” game. They also proved that the 
half-lie variant of the Ulam-Renyi game with comparison questions corresponds 
to the problem of finding the minimum root of a set of unimodal functions. 

Focusing on the case i = 1, in this paper we improve the result of 0, by 
giving very tight and non-asymptotic, lower and upper bounds for the half-lie 
game. More precisely, we prove that 

— for each m, qi{m) — 1 questions suffice; 

— for each m, qi{m) — 3 questions do not suffice; 

— for infinitely many m, qi{m) — 2 questions do not suffice, and we give an 
optimal strategy using exactly q\{m) — 1 questions; 

— for infinitely many m, qi{rn) — 2 questions suffice, and we give a (necessarily 
optimal) strategy using exactly qi{m) — 2 questions. 

2 Preliminaries 

We shall consider a game between two players, Paul (the Questioner), and Carole 
(the Responder). The rules are as follows: Paul and Carole first fix an integer m, 
and a search space E coinciding with the set of m-bit integers E = {0, 1, . . . , 2"* — 
1}. Now Carole chooses a number x in E, and Paul must find x by asking 
questions, to which Carole can only answer yes or no. Carole can lie at most 
once, but only by answering “no” to a question whose correct (sincere) answer 
is “yes”. Thus, only Carole’s negative answers are dubious for Paul. This is the 
the half-lie game over the search space E. 
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Paul’s state of knowledge is completely represented by a pair {Aq,A{) of 
disjoint subsets of E, where Aq (the truth-set) is the set of those elements of E 
satisfying all of Carole’s answers, and Ai (the lie-set) is the set of elements of E 
falsifying exactly one (necessarily negative) answer. By a question we understand 
any arbitrary subset T C E. Thus T represents Paul’s question “Does the secret 
number x belong to T?” . 

In particular, Paul’s initial state of knowledge, before any question is asked, 
is represented by the pair (if, 0). At any stage of the game, suppose that Paul’s 
state of knowledge is given by <t = (Ao,Ai), and his next question is T. If 
Carole’s answer is “yes” (whence, by hypothesis, this answer is correct) then the 
resulting state of knowledge of Paul is given by 

= (Aonr,AinT). 

On the other hand, if Carole’s answer is “no”, Paul’s new state of knowledge 
is given by 

a"° = (Ao\T, (Aonr)U(Ai\T)). 

This is so because the lie-set of cr”° has two kinds of elements: (i) members of 
the truth-set of cr that falsify Carole’s answer, and (ii) members in the lie-set of 
cr that satisfy Carole’s answer. 

The game is over when Paul’s state cr = (Aq, Ai) satisfies |Aq U Ai| < 1, 
where | • | denotes cardinality. As a matter of fact, if |Ao U Ai| = 1 then the only 
element of Aq U Ai must coincide with Carole’s secret number. On the other 
hand, if |Ao U Ai| = 0 then every element x G E falsifies at least two of Carole’s 
answers — against the rules of the game. 

Our main concern here is to estimate the minimum number of questions 
needed to find x, rather than actually exhibiting such questions. Accordingly, 
only the number of elements of Aq and Ai is relevant here, and we can safely 
represent Paul’s state (Aq, Ai), by the pair of integers (|Ao|, |Ai|). Similarly, any 
question T C E will be conveniently represented by the pair of integers [to,ti], 
where U = \Ai n T\. 

Using this notation, for any state cr = (a;o,a:i) and question [toAi] the two 
possible states resulting from Carole’s answer are given by 

= (tQ,ti) and cr”° = (xo - to, to -\- xi - h) (1) 



Definition 1. A final state is a state a = (xo,a;i) such that xq + x\ < 1. 

Starting with the state cr, suppose that questions T\, . . . ,Tt have been asked, 
and a corresponding t-tuple of answers b = bi, . . . ,bt, has been received. Iterated 
application of the above formulas yields a sequence of states ao,ui, . . . ,at, where 

(To = cr, (Ti = (Job (T 2 = crj’', . . . , at = (2) 

By a strategy S with q questions we mean the full binary tree of depth q, 
where each node v is mapped into a question and the two edges rjieft^riright 
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below V are respectively labeled yes and no. Let r] = 771 , . . . , be a path in 
S, from the root to a leaf, with respective labels bi,...,bq, generating nodes 
ui,...,Vq and associated questions . . . ,T^^. Fixing an arbitrary state a, 
iterated application of m naturally transforms cr into (where the dependence 
on the 5j’s and T,^ ’s is understood). We say that strategy S is winning for a iff 
for every path rj from the root to a leaf, the state is final. 

Definition 2. We say that a = (a;o,a^i) is a winning g-state ijf there exists a 
strategy with q questions which is winning for a. 

Trivially, cr is a winning g-state iff there exists a question T such that both 
resulting states and cr"° are winning (g — l)-states. 

Definition 3. We say that cr' = {yo, y\) is a substate of a state a = (xq, xi) iff 
Uj < Xj, for each j = 0, 1. 

Lemma 1. Let a = {xo,x\) he a winning q-state. Then every substate of a is a 
winning q-state. 

To help the reader, we shall recall here two basic definitions concerning the 
full-lie game im. 

Definition 4. For every integer m > 1, Berlekamp’s minimum number of ques- 
tions q{m) is defined by q(jn) = min{g | g -I- 1 < 2'^“'"}. 

Definition 5. For every integer s > 0 the sth critical index ms is defined by 
ms = max{77i | m -I- s -I- 1 < 2®}. 

It turns out |S| that g(m) is the minimum number of questions that are not 
only necessary, but also sufficient to solve the full-lie game with one lie over the 
search space of cardinality 2"*. The critical index TOs is the largest integer m 
such that m -|- s questions are necessary and sufficient to solve the full- lie game 
with one lie over a search space of cardinality 2"*. 

3 Lower Bounds 

Definition 6. Fix two integers m = 1,2,3,... and q > m. Then the integer 
K{m,q) is the smallest k such that (?) ^ integer p = p{m,q) is 

defined by p = 2'"' - (?) • 

Trivially, 1 < k(to, g) < g and 0 < p < ■ 

Theorem 1. Fix an integer m = 1, 2, 3, . . .. Let q^ be the smallest integer q such 
that 
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Note that 1 < < q{m). Assume there exists a winning strategy with q* ques- 

tions to find an unknown integer in the set E = {0, 1, . . . 2"* — 1} in the half-lie 
game, i.e., when up to one of the “no” answers may be mendacious and all “yes” 
answers are true. Then q* > q^ ■ 

Proof. Let S be such strategy, where we can safely assume that the number q* 
of questions in S is the smallest possible. By a path in S we shall understand 
a path from the root to a leaf of S. For each x G E there exists precisely one 
path fj,x in S leading to the final state ({a;},0). This final state is obtained if 
Carole chooses x as the secret number and then decides to give true answers to 
all q* questions. Let southwestx be the number of left branches (i.e., branches 
whose label is “yes”) in this path. The southwestx many branches of qix are 
a record of Carole’s “yes” answers, once she decides to choose x as the secret 
number, and to always tell the truth. If, on the other hand, Carole decides to 
give one mendacious answer, she can deviate from this path in southwestx many 
ways, replacing a “yes” true answer by a mendacious “no” answer. 0 Since S is 
a winning strategy, there exist in S precisely southwestx many paths leading to 
the final state (0, {a:}). Furthermore, whenever x ^ y G E the state (0, {?/}) will 
be obtained by a different path. Now, each path in S has q* branches, and paths 
are in one-one correspondence with subsets of {!,..., g*} (giving the depths 
of their left branches) . To obtain a set of 2™ paths having the smallest possible 
total number of left branches, with reference to Definition one must proceed as 
follows: first take the only path in S with no left branch, then the q* paths with 
1 left branch, the ) paths with 2 left branches, . . . , the with 

n(rn,q*) — 1 left branches, and finally, p{m,q*) many paths with K(m,q*) left 
branches. Then the total number A’once of possibilities for Carole to lie precisely 
once in S will satisfy the inequality 



d^once ^ ^ ( i 

i=0 

On the other hand, if Carole decides not to lie, she can still choose x G E, and 
the corresponding path leading to the leaf ({a;}, 0), in 2™ ways. Summing up, the 
2'J many paths of the winning strategy S cannot be less than the total number 
of Carole’s answering strategies, whence 



+ K{m,q*)p{m,q*). 



29* > 2™ + IVonce > £ i 

i=0 

We shall now prove that, for all integers m > 1, at least g(m) — 2 ques- 
tions are necessary to find a number in the half-lie game over the search space 
{0,1,. ..,2”^ — 1}. We prepare the following 

^ Note the effect of the assumed asymmetry in Carole’s lies: in the classical Ulam game 
Carole can choose to lie in q* many ways. 



-I- K{m, q*)p{m, q*). 
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Lemma 2. Let m = 2, 3, . . . . Let the integer q have the property that there exists 
a winning strategy S for the state a = (2"*, 0) with q questions. Then there exists 
a winning strategy for the state a' = 0) with q — 1 questions. 

Proof. Let T = [tg, 0] be the first question in S. The resulting states are = 
(to,0) and cr”° = {2^ — to, to), both having a winning (g— l)-strategy. According 
as to > 2™“^ or to < 2™“^, the state (2"*“^,0) is either a substate of or 
a substate of cr"°, respectively. In either case, (2'"“^,0) has a winning (g — 1)- 
strategy. 

Notation: For each real number t we let [tj (resp., [t] ) denote the largest integer 
< t (resp., the smallest integer > t). For each integer r > 0, we let power set^ 
denote the powerset of the set {l,2,...,r}. For any family 0 of subsets of 
{1, 2, . . . , r} we let 

card{0) 

denote the average cardinality of the sets in O. For all real numbers 0 < a < b < r 
we let 

powersetr[a, b] = power setr[\a ~\ , [5J] = {G C {1, 2, . . . , r} \ a < \G\ < b} . 
Given an integer 0 < k < 2’', the initial k-segment 

power setr \ k 

of powersetr is the family of the first k smallest subsets of {1, 2, . . . , r}, i.e., the 
0-element subset, the r many 1-element subsets, the ( 2 ) many 2-element subsets, 

. . . , until k subsets have been collected having the smallest possible cardinality. 

Lemma 3. The following inequalities hold: 

(i) For all q > 57, \powersetg[0, q/3] \ < 

(a) For all q > 171, \powersetqf),2q/L\\ < 

(Hi) For all q > 12, card{powersetq, [0,g/3]) > 

(iv) For all q > 23, card{powersetq, [0,2g/5]) > |. 

Proof. A routine inspection, using the properties of the binomial coefficients. 



Theorem 2. For each m > 1, let q{m) be Berlekamp’s minimum number of 
questions. Then at least q{m) — 2 questions are necessary to win the half-lie 
game (with one lie) over the search space {0, 1, . . .2"* — 1}. 

Proof. First we consider the case 1 < m < 50. Let ghaif-iie(™) denote the length 
of the shortest winning strategy for the half-lie game (with one lie) over the 
search space {0, 1, ... 2™ — 1}. By Definition 0 we have toq = 0 and me = 57. 
Hence for each 1 < to < 50 we have mo 1 < m < me. For all 0 < s < 5, a 
direct inspection using Theorem H yields ghaif-iie(?7rs + 1) > q^m-s -I- 1) — 2. By 
Lemma El the desired result now follows for all to^ -|- 1 < to < TOs+i. 
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We now consider the case m > 51. It follows that q{m) > 57. Writing q 



instead of q{m), we have 




29-1 ^ ^2"*, 


(3) 


whence, a fortiori. 




29-3 ^ 2'"(l-Hg/4). 


(4) 


By Lemma El) iii) we get 




29-3 ^ 2™(1 -1- card{powersetq[Q,q/'fi\). 


(5) 



From Lemma 0(i) it follows that 




( 6 ) 



Since by (0) <2™, the monotonicity properties of the binomial coefficients, 

together with inequality imply 

card{powersetq[0, q/3]) = card{powersetq[0, [g/3j]) < card{powersetq |'2"*). 
Recalling Onj we can write 

2?-3 ^ 2'"(1 + card{powersetq I'2™)). (7) 

A moment’s reflection shows that 



card{power setq |'2'") < card{powersetq-^ |'2"*). (8) 



Let 5 be a strategy with q — 3 of questions (absurdum hypothesis). As in 
the proof of Theorem ^ let A^once be the total number of possibilities for Carole 
to lie precisely once in S : this is the same as the number of left branches in the 
totality of paths of depth g — 3 leading to a final state ({a;},0), for all possible 
a; S if. In the light of (0-®, and recalling the estimates in the proof of Theorem 
E we can now write in a self-explanatory notation 



2 ? 3 ^ 2"*(1 -I- card{powersetq -3 |'2"*)) = 2'" -|- 2' 



I Z^powersetq — 3 \2'^ ^ \ j ' 






power setq — 2 f'2” 

which is impossible. 



Jq-i 

j 



< 2'" -h iVo 



^^powersetq — 3 ['2 

(m = 51,52,53,...), 



Theorem 3. With reference to Definitions^ andQ fix an integer s > 1, and 
let m = iris and q = q{iTis). Then at least q — I questions are necessary to solve 
the half-lie game with one lie, over the search space of cardinality 2"*. 
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Proof. By definition, m = TOs = 2® — s — 1, and q = q{m,s) = m + s = 2® — 1. For 
1 < s < 8, direct verification shows that the integer q^fuis) defined in Theorem^ 
coincides with (/(mg) — 1, and hence the thesis is a direct consequence of Theorem 

III 

Let us now consider the case s > 9. Suppose that there exists a winning 
strategy for Paul using q — 2 questions (absurdum hypothesis). In the light of 
Theorem ^ in order to reach a contradiction, it is sufficient to show 

2® < 4(1 + card{powersetq -2 |'2"*)). (9) 

Claim: card{powersetq -2 \ 2'")) > m/3. 

As a matter of fact, for all s > 9, by Lemma|3iv) and (ii) we have m/3 < q/3 < 
card{powersetq[0,2q/5]) = card{powersetq[0, [2q/5j]) < card{powersetq 

It follows that m/3 < card{powersetq-2 \ Si’^ce for all s > 4, = 

1 i( 2 °-i) ^ ^ 2 ‘ ~ 2^°“®“^ = 2"*, we then get m/3 < card{powersetq -2 |'2"*) 

and the claim is proved. 

From equation m we now see that, to complete the proof of the theorem it 
is sufficient to settle the inequality 2® < 4(l + m/3). Since m = 2® — s — 1, this 
inequality trivially holds for all integers s > 1. 

4 Upper Bounds 

For all m > 1, we shall construct a winning strategy to solve the half- lie game 
over the search space {0, 1, . . . , 2"* — 1} using no more than q{m) — 1 questions. 
Moreover, we shall show that for infinitely many m this strategy uses no more 
than q{m) — 2 questions. 

Theorem 4. For every integer m > 1, let a = (2"*,0). Then there exists a 
winning strategy for a in the half-lie game, using exactly m — 1 -\- |"log(m -I- 3)] 
questions. 

Proof. The proof proceeds through several claims: 

Claim 1: For each integer r > 1, the state cr = (1,2'' — !) has a winning r-strategy. 

By induction on r. For r = 1, we have a = (1, 1). Upon asking the question 
[0,1], we obtain the two final states (0,1) and (1,0). For the induction step, 
suppose the claim is true for r > 1, with the intent of proving it for the state 
(1, 2’'+^ — 1). Let cr = (1, 2''+^ — 1) and ask the question [0, 2’']. We then obtain 
the two states cr^®® = (0, 2’’) and cr"° = (1, 2® — 1). It is easy to see that cr^®® is 
a winning r-state. Moreover, cr"® is a winning r-state by induction hypothesis. 
Therefore, cr is a winning (r -|- l)-state, and the claim is settled. 

Claim 2: For each integer r > 1, the state cr = (2, 2’’ — 3) is a winning r-state. 

Upon asking question [1, 2’'“^ — 1], the two resulting states are cr*^®® = cr”° = 
(1, 2’’“^ — !). By Claim 1, both states have a winning strategy with r— 1 questions. 
A fortiori, there exists a winning strategy with r questions for the state (2, 2''— 3). 
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Claim 3: For every integer to > 1, the state a = (2, to) has a winning strategy 
with [log(TO + 3)] questions. 

As a matter of fact, let r = |"log(TO + 3)] . Then r = min{i | 2*“^ — 3 < to < 
2* — 3}. Therefore, cr is a substate of (2,2’' — 3) and the desired result directly 
follows, by Claim 2 and Lemma ^ 

Claim For any two integers to > 1 and fc = l,2,...,TO, there exists a winning 
strategy for the state a = (2^, (to — fc + 1)2^“^) , using exactly k—l+ |’log(TO + 
3)] questions. 

We proceed by induction on k. The basis is a direct consequence of Claim 
3. For the induction step, let k = i + 1. Then tr = (2*+^, (to — z)2*) . By asking 
the question [2*, (to — i + 1)2’“^] , the two resulting states coincide — specifically, 
^yes _ ^no _ (^2*, (to — i + 1)2’“^) . By induction one immediately sees that 
(jves _ ^no jg winning (j — 1 + [log(TO + 3)] )-state. Therefore, cr is a winning 
{k — 1+ |'log(TO + 3)] )-state, as required to settle the claim. 

End of the Proof of Theorem^ Let a = (2’",0). Then cr is a substate of 
(2™, 2’"“^) and we have the desired result by Claim 4 and Lemma ^ 

In the following two lemmas, upper bounds on the number of questions in the 
shortest winning strategy for the half-lie game over a search space of cardinality 
2™, are given in terms of Berlekamp’s minimum number q(m). 

Lemma 4. For every integer m > 1, let E = {0,1,..., 2’" — 1}. Then there 
exists a winning strategy to solve the half-lie game over E, using < q{m) — 1 
questions. 

Proof. By Theorem Elthere exists a winning strategy using to — 1 -|- |"log(TO -1-3)] 
questions. Then we need only show that for all to > 1, 

TO — 1 -I- [log(TO -I- 3)] < q{m) — 1. 

Let k = m — 1 [log(TO -I- 3)] . By definition of Berlekamp’s number q{m), it is 
sufficient to settle the inequality A: -|- 1 > 2^“™, that is, 

m+ {log(TO-h3)l > 2r'°s("’+3)l-i^ 

To this purpose, let s > 0 be the smallest integer such that to = 2® — t for some 
0 < t < 2®“^. Then we argue by cases: 

Case 1: t > 3. Then (IIOII becomes 2® — t-l-s > 2®“^. This latter inequality holds 
true, because 2®“^ > t > 3 implies s > 2, whence 2®“^ s > t. 

Case 2: 0 < t < 2. Then iTTO becomes 2® — t -|- s -I- 1 > 2®, or equivalently, 

s + l>t. (11) 

Since 2®“^ > t, the only possible cases to be considered are 

t = 0, s>0; t = l, s>2; t = 2, s > 3. 

In every case, inequality (I I III is satisfied and the proof is complete. 
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Lemma 5. Fix integers n > 1 and S < t < n. Let to = 2” — L Then there exists 
a winning strategy for the state a = (2™, 0) using < q{m) — 2 questions. 

Proof. Let k = m — 1 + [log(TO + 3)] . By Theorem 0 there exists a winning 
strategy for a using k questions. Then we need only prove that k < q(rn) — 1. 
Recalling the definition of q{m), it is enough to settle the inequality fc + 2 > 
2^+1“™, that is, TO + 1 + |"log(TO + 3)] > From our hypotheses 

to = 2" — t<2" — 3 and t < n < 2"“^, we obtain [log(TO + 3)] = n. It follows 
that TO + 1 + [log(TO + 3)] = 2" — t + 1 + n > 2" = 2r*°s("*+3)] ^ required to 
conclude the proof. 

5 Conclusion: Main Results 

Theorem 5. For eaeh m = 1,2,..., let q(m) be the (Berlekamp) minimum 
integer q such thatq+1 < 2'^“"*. Let ghaif-Ue(w) denote the number of questions 
of the shortest winning strategy for the half-lie game over the search space E = 
{0, 1, . . . , 2™ — 1}. Let the sets of integers C and Eg (s = 1, 2, 3, . . .) be defined by 
C = {to > 0 I q{m) = — 1} and Eg = {to > 0 | TOs + 1 < rn < TOs + s — 2}. 

Then 

(A) qfm) - 2 < ghaif-iie(m) < q{m) - 1, for each to = 1, 2, 

(B) ghaif-iie(m) = q{m) - 1, for all uiGC. 

(GJ ghaif-iie(w) = q{m) — 2, for each s = 1,2, .. . and m G Eg. 

Proof. (A) By Lemma^and Theorem|3 (B) By Lemma0|and Theorem 0 (C) 
By Theorem 12 and Lemma |2 
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Abstract. In effective analysis, various classes of real numbers are dis- 
cussed. For example, the classes of computable, semi-computable, weakly 
computable, recursively approximable real numbers, etc. All these classes 
correspond to some kind of (weak) computability of the real numbers. 
In this paper we discuss mathematical closure properties of these classes 
under the limit, effective limit and computable function. Among others, 
we show that the class of weakly computable real numbers is not closed 
under effective limit and partial computable functions while the class of 
recursively approximable real numbers is closed under effective limit and 
partial computable functions. 



1 Introduction 

In computable analysis, a real number x is called computable if there is a com- 
putable sequence (xn)neN of rational numbers which converges to x effectively. 
That is, the sequence satisfies the condition that jxn — a;| < 2“”, for any n G N. 
In this case, the real number x is not only approximable by some effective proce- 
dure, there is also an effective error-estimation in this approximation. In practice, 
it happens very often that some real values can be effectively approximated, but 
an effective error-estimation is not always available. To characterize this kind 
of real numbers, the concept of recursively approximable real numbers is intro- 
duced. Namely, a real number x is recursively approximable (r.a., in short) if 
there is a computable sequence (x„)„gN of rational numbers which converges to 
x. It is first noted by Ernst Specker in US] that there is a recursively approx- 
imable real number which is not computable by encoding the halting problem 
into the binary expansion of a recursively approximable real numbers. 

The class Ce of computable real numbers and the class C^a of recursively 
approximable real numbers shares a lot of mathematical properties. For exam- 
ple, both Ce and C^a are closed under the arithmetical operations and hence 
they are algebraic fields. Furthermore, these two classes are closed under the 
computable real functions, namely, if x is computable (r.a.) real number and / 
is a computable real function in the sense of, say, Grzegorczyk |S|, then f{x) is 
also computable (resp. r.a.). 

The classes of real numbers between Ce and C^a are also widely discussed 
(see e.g. |12li;iiII2llHj ). Among others, the class of, so called, recursively enu- 
merable real numbers might be the first widely discussed such class. A real 

D.-Z. Du et al. (Eds.): COCOON 2000, LNCS 1858, pp. 1 TO- TTni 2000. 
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number x is called recursive enumerable if its left Dedekind cut is an r.e. set 
of rational numbers, or equivalently, there is an increasing computable sequence 
of rational numbers which converges to x. We prefer to call such real 
numbers left computable because it is very naturally related to the left topology 
T< := {(a;oo) : a € IR} of the real numbers by the admissible representation 
of Weihrauch m- Similarly, a real number x is called right computable if it is 
a limit of some decreasing computable sequence of rational numbers. Left and 
right computable real numbers are called semi- computable. Robert Soare 
discusses widely the recursion-theoretical properties of the left Dedekind cuts of 
the left computable real numbers. G. S. Ceitin ^ shows that there is an r.a. real 
number which is not semi-computable. Another very interesting result, shown by 
a series works of Chaitin |S| , Solovay Calude et al. |2| and Slaman says 
that a real number x is r.e. random if and only if it is an 17-number of Chaitin 
which is the halting probability of an universal self-delimiting Turing machine. 
We omit the details about these notions here and refer the interested readers to 
a nice survey paper of Calude P) . 

Although the class of left computable real numbers has a lot of nice prop- 
erties, it is not symmetrical in the sense that the real number —x is right 
computable but usually not left computable for a left computable real num- 
ber X. Furthermore, even the class of semi-computable real numbers is also not 
closed under the arithmetical operations as shown by Weihrauch and Zheng CHI 
Namely, there are left computable real numbers y and z such that y — z is neither 
left nor right computable. As the arithmetical closure of semi-computable real 
numbers, Weihrauch and Zheng CHI introduces the class of weakly computable 
real numbers. That is, a real number x is weakly computable if there are two left 
computable real numbers y and z such that a; = y — z. It is shown in CHI that x is 
weakly computable if and only if there is a computable sequence which 

converges to x weakly effectively, i.e. lim^^oo = x and 1^" ~ Xn+i\ is 

finite. By this characterization, it is also shown in that the class of weakly 
computable real numbers is an algebraic field and is strictly between the classes 
of semi-computable and r.a. real numbers. In this paper we will discuss other 
closure properties of weakly computable real numbers for limits, effective limits 
and computable real functions. We show that weakly computable real numbers 
are not closed under the effective limits and partial computable real functions. 
For other classes mentioned above, we carry out also a similar discussion. 

At the end of this section, let us explain some notions at first. For any set 
A C N, denote by xa '■= '^n^A number whose binary expansion cor- 

responds to set A. For any /c G N, we define kA := {fcn : n G A}. For any function 
/ : N ^ N, a set A is called /-r.e. if there is a computable sequence (A„)„gN of 
finite subsets of N such that A = UigN Flj>i Aj and |{s : n G AgAAg +i}| < f{n) 
for all n G N, where AAB := (A\B) U (B\A). If f {n) := k is & constant func- 
tion, then /-r.e. sets are also called fc-r.e. A is called oj-i.e. iff there is a recursive 
function / such that A is /-r.e. 
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2 Computability of Real Numbers 



In this section we give at first the formal definition of various versions of com- 
putability of real numbers and then recall some important properties about these 
notions. We assume that the reader familiar the computability about subsets of 
the natural nmbers N and number-theoretical functions. A sequence (a;n)nGN of 
rational numbers is computable iff there are recursive functions a,b,c: N ^ N 
such that Xn = {a{n) — b{n))/{c{n) + l). We summarize the computability notions 
for real numbers as follows. 



Definition 1. For any real number a; G IR, 

1. a: is computable iff there is a computable sequence (a;„)„gN of rational 
numbers such that x = lim„^oo Xn and Vn (|a;„ — a;„+i| < 2“”). In this case, the 
sequence {xn)neN is called fast convergent and it converges to x effectively. 

2. a; is left (right) computable iff there is an increasing (decreasing) com- 
putable sequence (xn)neN of rational numbers such that x = lim„^ooa^n- Left 
and right computable real numbers are all called semi- computable. 

3. x is weakly computable (w.c. in short) iff there is a computable sequence 

{xn)neN of rational numbers such that x = lim„^oo Xn and ~ Xn+i \ is 

finite. (a;„)„gN is called converging to x weakly effectively. 

4. X is recursively approximable (r.a., in short) iff there is a computable se- 
quence (a;„)„gN of rational numbers such that x = lim„^oo Xn. 

The class of computable, left computable, right computable, semi-computable, 
W.C., r.a. real numbers is denoted by Cg, C/c, Crc, Csc, Cwc, C^a, respectively. 

As shown in PH| , the relationship among these classes looks like the following 



Ce = c,,nc 



^ Cl, 

rc ^ ^ 

\~^r( 



c Cgc — C/c U C,c C Cuic C Cr 



Note that in above definition, we define various versions of computability 
of real numbers in a similar way. Namely, a real number x is of some version 
of computability iff there is a computable sequence of rational numbers which 
satisfies some special property and converges to x. For example, if P/c[(a;„)] 
means that (x„)„gN is increasing, then x G C/c iff there is a computable sequence 
(a:„)„gN of rational numbers such that Pic[{xn)] and lim^^oo Xn = x. In general, 
for any reasonable property on sequences, we can define a corresponding class of 
real numbers which have some kind of (weaker) computability. This can even be 
extended to the case of sequences of real numbers as in the following definition. 

Definition 2. Let P be any property about the sequences of real numbers. Then 

1. A real number x is called P -computable if there is a computable sequence 
(cc„)„gN of rational numbres which satisfies property P and converges to x. The 
class of all P-computable real numbers is denoted by Cp 

2. A sequence (xn)rtGN of real numbers is called P-eomputable, or it is a 
eomputable sequence o/ Cp iff there is a computable double sequence {rnm)nmeN 
of rational numbers such that (r„m)mGN satisfies P and lim^^oo Xnm = Xn for 
all n G N. 
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3. The class Cp is called ^‘‘closed under limits" , iff for any computable se- 
quences of Cp, the limits x := lim„^oo is also in Cp whenever 

{xn)neN satisfies P and converges. 

4. The class Cp defined in 2. is called “closed under effective limits", iff 

for any fast convergent computable sequences of Cp, the limits x := 

lim„^oo Xn is also in Cp. 

Now we remind the notion of computable real function. There are a lot of ap- 
proaches to define the computability of real functions. Here we use Grzegorczyk- 
Ko-Weihrauch’s approach and define computable real function in terms of “Type- 
two Turing Machine” (TTM, in short) of Weihrauch. 

Let S be any alphabet. E* and are sets of all finite strings and infinite 
sequences on E, respectively. Roughly, TTM M extends the classical Turing ma- 
chine in such a way that it can be inputed and also can output infinite sequences 
as well as finite strings. For any p G 27* U E°° , M{p) outputs a finite string q, 
if M(j>) writes q in output tape and halt in finite steps similar to the case of 
classical Turing machine. M(p) outputs an infinite sequence q means that M(p) 
will compute forever and keep writing q on the output tape. We omit the formal 
details about TTM here and refer the interested readers to nani. We will omit 
also the details about the encoding of rational numbers by E* and take directly 
the sequences of rational numbers as inputs and outputs to TTM’s. 

Definition 3. A real function / :C ffi ^ ffi is computable if there is a TTM M 
such that, for any x G dom(/) and any sequence of rational numbers 

which converges effectively to x, M ((u„)„gn) outputs a sequence (vn)neN of 
rational numbers which converges to f{x) effectively. 

Note that, in this definition we do not add any restriction on the domain of 
computable real function. Hence a computable real function can have any type 
of domain, because / |" A is always computable whenever / is computable and 
A C dom(/). Furthermore, for a total function / : [0, 1] — > K, / is computable 
iff / is sequentially computable and effectively uniformly continuous (see 101). 

Definition 4. For any subset C C E, 

1. C is closed under computable operators, iff f{x) G C for any x G C and 
any total computable real function / : E — > E. 

2. C is closed under partial computable operators, iff f{x) G C, for any x G C 
and any partial computable real function / :C E — > E with x G dom(/). 

Following proposition follows immediately from the definition. Remember 
that, A C N is ^2 iff A is Turing reducible to the halting problem 0'. 

Proposition 1. 1. xa G Cg A is rcursive. 

2. Xa G Cra -<=^ A is a A^-set, or equivalently, A <t %' ■ 

3. Ce and Cra are closed under arithmetical operations , x and -G, hence 

they are algebraic fields. 
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4-- Ce are closed under limits and computable real functions. 

5. Cic and C^c are closed under addition. 

Some other non-trivial closure properties are shown in PHI and Pi- 

Theorem 1 (Weihrauch and Zheng). 1. C^c is not closed under addition, 
i.e. there are left computable y and right computable z such that y + zis neither 
left nor right computable. 

2. Cwc is closed under arithmetical operations. In fact C^c is just the closure 
of Csc under the arithmetical operations. 

It is not very surprising that the classes C/c and C^c are not closed under 
“subtraction” and, in general, under computable real functions, because they are 
not symmetrical. On the other hand, the class Cwc is symmetrical and closed un- 
der arithmetical operations. So it is quite natural to ask whether it is also closed 
under limits and computable real functions. In the following we will give the neg- 
ative answers to both questions. To this end we need the following observations 
about weakly computable real numbers. 

Theorem 2 (Ambos-Spies, pjj). 1. If A, B CN are incomparable under Tur- 
ing reduction, then is not semi- computable. 

2. For any set A C N, if X 2 A is weakly computable, then A is f-r.e. for 
f{n) := 2^", hence A is tu-r.e. 

Theorem 3 (Zheng |2n|). There is a non-uj-r.e. A^-set A such that xa is 
weakly computable. 

3 Closure Property under Limits 

In this section, we will discuss the closure properties of several classes of real 
numbers under limits. We first consider the classes of left and right computable 
real numbers. The following result is quite straightforward. 

Theorem 4. The classes of left and right computable real numbers are closed 
under limits, respectively. 

For semi-computable real numbers, the situation is different. 

Theorem 5. The class Csc is not closed under limits. 

Proof. Define, for any n, s € N, at first the following sets: 

A := {e G N : (/5e is total} 

A„ := {e G N : (Vx < n)(pe{x) }} 

An,s := {e G N : (Vx < n)ipe,s(x) }} 

Since A„_s C Ag^g+i, (xA„,,)n,seN is obviously a computable sequence of rational 
numbers such that, for and n G N, (xa„,,)sgn is nondecreasing and converges to 
XA„. That is, (xA„)nGN is a computable sequence of C/c, hence it is a computable 
sequence of Csc- But its limit xa is not semi-computable. In fact xa is even not 
r.a. by Proposition [fl since A is not a A^-set. □ 
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Note that in above proof, as a computable sequence of C/c, (a;A„)nGN is also 
a computable sequence of C^ic and C^a- Then the folowing corollary follows 
immediately. 

Corollary 1. The classes C^c cifid Cra oltc not closed under the limit. 

Now we discuss the closure property under the effective limits. We will show 
that the class of semi-computable real numbers is closed under effective limits 
and the class of weakly computable real numbers, hence also the class of r.a. real 
numbers, is not closed under effective limits. 

Theorem 6. The class Csc is closed under the effective limits. 

Proof. Let be a computable sequence of Cgc which satisfies the condition 

that Wn{\xn —Xn+i \ < and converges to x. We shall show that x G C^c. 

By DefinitionEl there is a computable sequence (rij)ijgfj of rational numbers 
such that, for any n G N, (rnj)neN is monotonic and converges to Xn- For any 
n, we can effectively determine whether Xn is left or right computable by com- 
paring, say, r„o and r„i. Therefore, the sequence can be split into two 

computable subsequences and (a;mi)iGN of left and right computable 

real numbers, respectively. At least one of them is infinite. Suppose w.l.o.g. 
that (a;rti)iGN is an infinite sequence. Obviously it is also a fast convergent com- 
putable sequence, i.e., — Xm+i \ < 2“*, since converges fast. Define 

a new sequence (?/„)„6 n by yt := - 2“b-i), Since yt+i = Xm+i - = 

{Xrii+i - Xm + 2“* -I- {Xm - 2“d-i)) > - 2“b-i) = (?/j)j6n is an increasing 

sequence. Let rb := nj — 2“b-i), Then (rb)ijgN is a computable sequence of 
rational numbers such that, for any i, is increasing and converges to 

yi. Namely, (j/i)iGN is an increasing computable sequence of C/c- By Theorem E] 
its limit limi^oo Vi = limi^oo x„. = limi^oo Xi = x is also left computable, i.e., 
X G Cic C Csc. n 

Theorem 7. The class C^c is not closed under effective limits. 

Proof. Suppose by Theorem 0 that A is a non-w-r.e. A^-set such that xa is 
weakly computable. Then X 2 A is not weakly computable by Theorem 0 Let 
(As)sgn be a recursive approximation of A such that (a;A,)sGN converges to xa 
weakly effectively, i.e. ~ xa^^A < C for some C G N. Define, for 

n, s G N, 



Bn^s := 2(As f (n -I- 1)) U (Ag t 2n) 

B„:=2(A r(n+l))U(A [2n) 

It is easy to see that (i?n.s)n,sGN is a computable sequence of finite subsets 
of N, hence J„_sgn is a computable sequence of rational numbers. 

Since lirng^oo As = A, there is an Nfn), for any n G N such that, for any 
s > N{n), As f (n -I- 1) = A [■ (n -I- 1). Let Ci = \xBr^,s ~ Xb^^^+A- Then 
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Er=o I = Efio ^ I + ET>mu) l^B^.s - I 

= Cl + J2^N(n) l^^n.s ~ < Cl + C. On the other hand, it is easy to 

see that linis^oo = xb^- Therefore, the sequence {xB„^,)n,seN converges 
to xb„ weakly effectively. Hence (a:B^)„gN is a weakly computable sequence of 
real numbers. By the definition of H„, Bn A2A C {2n+ l,2n + 2, . . .}. It follows 
that \xb^ — X 2 a\ < 2“^" < 2“" This means that (a;B„)„gN converges to X 2 A 
effectively and this ends the proof of the theorem. □ 



Theorem 8. The class Cra is closed under effective limits. 

Proof. Let (a;„)„gN be any computable sequence of Cra which converges ef- 
fectively to X. Assume w.l.o.g. that it satisfies, for all n G N, the condition 
\xn — Xn+i\ < By Definition 121 there is a computable sequence (rij)ij^fq 

of rational numbers such that, for any n G N, lims_>oo rns = Xn. We shall show 
that X G Cra- 

It suffices to construct a computable sequence (ws)sgN of rational numbers 
such that lirng^oo Us = x. This sequence will be constructed from in 

following stages: 

The construction of sequence (us) sgn^ 

Stage s = 0: Define uq := rgo, t(0, 0) := 0 and i(0) := 0. 

Stage s -I- 1: Given i(s), uq, . . . , Uj(s) and t{j, s) for all j < s. If there is j < s 
satisfying |uqj_i g)— r^sl < such that either t{j, s) —1 & \ut{i.s)~Xjs\ > 

2-0+1) Qj. g) ^ _X ,§2 g) = choose jo as minimal such j and 

define 

'i(s-l-I) := i(s) -I- 1 

lli(s-l-l) •= Xjo^s 

(*) < ( t{j, s) if 0 < j < jo 

t{j, s -k 1) := < i(s) -k 1 if j = jo 

[ -1 if jo <j <s + I. 

Otherwise, if no such j exists, then define, i{s + I) := i(s), t{s -k 1, s -k 1) := — I 
and t{j, s -k I) := t{j, s) for all j < s. 

To show this construction succeeds, we need only to prove the following 
claims. 

1. For any j G N, the limit t{j) := lirng^oo t{j, s) exists and satisfies the condi- 
tions that t{j) — I and |u(Q) — Xj \ < 

2. lims^oo i{s) = -koo. 

3. For any j G N, (Vs < t{j)){\u^j^ - Ug| < 2~^). 

Now it is clear that the sequence (us)sgn constructed above is a computable 
infinite sequence of rational numbers. Furthermore, this sequence converges to 
X. This completes the proof of Theorem. □ 
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4 Closure Property under Computable Operators 

In this section we will discuss the closure property under computable operators. 
The following result about left and right computable real numbers is immediate 
by the fact that the real function / defined by f{x) = —x is computable. 

Proposition 2. The classes Cic and Crc not closed under the computable 
operators, hence is also not closed under partial computable operators. 

To discuss the closure property under partial computable operators for other 
classes, we will apply the following observation of Ko [Zj. 

Theorem 9 (Ker-I Ko |7|)- For any sets A,BCN, A <t B iff there is a 
(partial) computable real function / :C K ^ K such that f{xB) = xa- 

^From this result, it is easy to show that a lot of classes of real numbers are 
not closed under the partial computable operators. 

Theorem 10. The classes Cgc o,nd Cwc cltc not closed under the partial com- 
putable operators. The class Cra is closed under partial computable operators. 

Proof. 1. For class Cgc- By Muchnik-Friedberg Theorem (see El). there are 
two r.e. sets A and B such that they are incomparable under Turing reduction. 
Then x^^^-g is not semi-computable by Theorem 0 On the other hand, xa®b 
is left computable since A (B B is r.e. Obviously, we have the reduction that 
A® B <T A® B. By Theorem 0 there is a computable real function / such 
that f{xA(BB) = Therefore, Csc is not closed under partial computable 

operators. 

2. For class C^c- By Theorem 01 there is a non-w-r.e. set A such that xa 
is weakly computable. On the other hand, X 2 A is not weakly computable by 
Theorem 0 since 2A is obviously not w-r.e. Because 2A <t A, by Theorem El 
there is a computable real function / such that /{xa) = X 2 A- That is, Cwc is 
not closed under the partial computable operators. 

3. For class Cra, it follows immediately from the fact that a real number xa 

is r.a iff 4 is a Zl^-set and the class of all Zl^-sets is closed under the Turing 
reduction, i.e. if A <t B and B is Zl^-set, then A is also Zl^-set. □ 

It is shown in Theorem 0 that the class Csc is not closed under addition. 
Hence it is not closed under the polynomial functions with several arguments. 
Namely, if p{x, ■ ■ ■ ,Xn), n > 2, is a polynomial with rational coefficients and 
ai, • • • , a„ are semi-computable real numbers, then p{a\, • • • , a„) is not necessary 
semi-computable. But for the case of n = 1, it is not clear. Furthermore it is also 
not known whether the class of semi-computable real numbers is closed under 
the total computable real functions. 

Similarly, it remains still open whether the class Cwc is closed under (total) 
computable operators. We guess it is not. One possible approach is to define a 
computable real function which maps some weakly computable xa for a non-w- 
r.e. set H to a not weakly computable real number X 2 A- Using the idea in the 
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proof of Theorem El it is not difficult to show that there is a computable real 
function f :C W W such that f{xA) = X 2 A for any irrational xa- Unfortu- 
nately, such function cannot be extended to a total computable real function as 
shown by next result. 

Theorem 11. 1. Let / :C N — > N &e a function such that f{xA) = X 2 A for any 

irrational xa- If xa is a rational number, then there is a sequence of 

irrational numbers such that lim„^oo Xn = X 2 A ond lim^^oo f{x„) = X 2 A- 
2. The function f : [0; 1] ^ K defined by f{xA) '■= X 2 A for any C N zs not 
continuous at any rational points, hence it is not computable. 

Proof. 1 . Suppose that function / :C N — > N satisfies f{xA) = X2A for any 
irrational xa- Let xa be rational, hence T is a finite set. We define a sequence 
of irrational numbers by Xn ■= xa + V^'‘ 2 .~^"‘~^^'^ . Let no be the maximal 
element of A. Define a set An by xa„ = for any n S N. Then for 

any n > no, xa„ < 2 “” < 2 “"“. This implies that contains only the elements 
which are bigger than no- Therefore, for any n > no, AC\ An = 0 and f{xn) = 
f{xA + • 2 = f{xA + Xa„) = fixAUAr,) = a^2(ylUA„) = 3;(2A)U(2A„) = 

X2A + X2An- Since lim„^oo xa„ = 0, it is easy to see that lim„^oo 2^2A„ = 0 too. 
So we conclude that lim„^oo f{xn) = X2A- 

2. Suppose that / : [0; 2] ^ ffi satisfies f{xA) = X 2 A for any A C N. For any 
rational xa, A is finite. Let no be the maximal element of A and A! := A\{no} 
and define, for all n G N, a finite set An by := A'U{no + l, no + 2, - - ■ , no + n}. 
Then it is easy to see that lim„_>oo xa„ = xa- On the other hand we have: 

n 

f{xAj = X2An = X2A' + ^ 2“^Oo+d 

= X2A - 2"^”“ -f 2"2"0 . (1 _ 2-2")/3 

This implies that lim„^oo f{xAn) = X 2 A — 2“^”“+^/3 yf X 2 A- □ 

In summary, the closure properties of several classes of real numbers under 
arithmetic operations (-I-, — , x and -7), limits, effective limits, partial com- 
putable operators and computable operators are listed in the following table: 





arithmetic 

operations 


limits 


effective 

limits 


computable 

operators 


partial computable 
operators 


Ce 


Yes 


Yes 


Yes 


Yes 


Yes 


Cic 


No 


Yes 


Yes 


No 


No 


C'rc 


No 


Yes 


Yes 


No 


No 


C.c 


No 


No 


Yes 


7 


No 


^wc 


Yes 


No 


No 


7 


No 


^ra 


Yes 


No 


Yes 


Yes 


Yes 
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Abstract. The complete list of forbidden minors (obstructions) for the 
family of graphs with vertex cover 6 have been found. This paper shows 
how one can limit the search space of graphs and how to simplify the pro- 
cess for deciding whether a graph is an obstruction for fe-VERTEX Cover. 
The upper bounds 2fc -|- 1 {2k -|- 2) on the maximum number of vertices 
for connected (disconnected) obstructions are shown to be sharp for all 
fc > 0. 



1 Introduction 

The main contribution of this paper is the characterization of graphs with vertex 
cover at most 6 by its obstruction set (forbidden minors). The general problem 
of vertex cover (which is AfP-complete; see ftl,l79| ) asks whether a graph has a 
set of vertices of size at most k that covers all edges (a more formal definition 
is given below). Earlier Cattell and Dinneen in [t 1D94| classified the families of 
graphs with vertex cover at most 5 by using the computational machinery now 
described in ICDD+nni . Our current results are based on a more family-specific 
approach where we limit the search space of graphs. In this paper, as our primary 
limiting factor, we prove an exact upper bound on the number of vertices for an 
obstruction to any fc- Vertex Cover family. 

The numbers of obstructions for 1-Vertex Cover to 5-Vertex Cover, 
along with our new result for 6-Vertex Cover, are listed below in Tabled 

We had known that the set of obstructions for 6-Vertex Cover is finite 
by the now-famous Graph Minor Theorem (GMT) of Robertson and Seymour 
They proved Wagner’s conjecture which states that there are a finite 
number of obstructions for any graph family closed under the minor order. Un- 
fortunately the proof of the GMT does not indicate how to find these obstruc- 
tions. The set of planar graphs are the best known example of a family with 
“forbidden graphs”, where Kuratowski’s characterization provides us with 
and A" 3,3 as the only obstructions to planarity. A lot of work has recently been 
done concerning the development of general methods for computing minor-order 
obstructions, such as mentioned in the papers fuLMr^RTMlLlmi rpTn^. 

In this paper we use standard graph theory definitions. A graph is a pair 
{V,E), where U is a finite set of vertices and if is a set of undirected edges 

D.-Z. Du et al. (Eds.): COCOON 2000, LNCS 1858, pp. ISO- TTOTl 2000. 

(c) Springer- Verlag Berlin Heidelberg 2000 
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Table 1. Numbers of obstructions for A:-Vertex Cover, 1 < A: < 6. 



k 


Connected 

obstructions 


Disconnected 

obstructions 


Total 

obstructions 


1 


1 


1 


2 


2 


2 


2 


4 


3 


3 


5 


8 


4 


8 


10 


18 


5 


31 


25 


56 


6 


188 


72 


260 



connecting two vertices of V. An edge between vertices x and y of V will be 
denoted by xy. 

A graph H is a, minor of a graph G, denoted H <m G, if a graph isomorphic 
to H can be obtained from G by a sequence of a operations chosen from: (1) 
delete a vertex, (2) delete an edge, or (3) contract an edge (removing any multiple 
edges or loops that form). The minor order is the set of finite graphs ordered 
by <m and is easily seen to be a partial order. 

A family iF of graphs is a (minor- order) lower ideal, if whenever a graph 
G G iF implies that H £ fF for any H <m G. An obstruction (often called 
forbidden minor) O for a lower ideal T is & minor-order minimal graph not in T . 
Thus, for example, and are the ‘smallest’ non-planar graphs (under the 
minor order). Recall that by the GMT, a complete set of obstructions provides 
a finite characterization for any (minor-order) lower ideal T . 

The graph families of interest in this paper are based on the following problem 



Problem. Vertex Cover 

Input: Graph G = (V,E) and a positive integer k < \V\. 

Question: Is there a subset V' C V with \V'\ < k such that V contains at least 
one vertex from every edge in El 

A set V in the above problem is called a vertex cover for the graph G. The 
family of graphs that have a vertex cover of size at most k will be denoted by 
A:-Vertex Cover, which is easily seen to be a lower ideal (see ISM]). For a 
given graph G, let VC(G) denote the least k such that G has vertex cover of 
cardinality k. Figure ^shows an example of a graph G with VC(G) = 4. 

We now describe the organization of our paper. In the next section we first 
explain our computational model and give some general results relating to how to 
compute obstructions for any /c-Vertex Cover lower ideal. We then state some 
specific results regarding 6-Vertex Cover, namely vertex and edge bounds. 
After a short conclusion and references, we compactly list all of the connected 
obstructions for 6- Vertex Cover in Figures EJEl Complete proofs for some of 
the results and larger drawings of the obstructions may be found in [DXQQj . 
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Fig. 1. An example of a graph of vertex cover 4. The black vertices denote one 
possible vertex cover. 



2 Computing Minor-Order Obstructions 

Our basic computational method is simply to generate and check all graphs 
that are potential obstructions. In practice, this search method can be used 
for an arbitrary lower ideal if one can (1) bound the search space of graphs 
to a reasonable size and (2) easily decide whether an arbitrary graph is an 
obstruction. With respect to /c-Vertex Cover, we can do both of these tasks 
efficiently. 

In earlier work, Cattell and Dinneen in |HD94| bounded the search space to 
graphs of pathwidth at most k + 1 when they computed the A:- Vertex Cover 
obstructions for 1 < fc < 5. Their generation process was self-terminating but, 
unfortunately, many isomorphic graphs were created during the generation of 
the search space. These superfluous graphs had to be either caught by an iso- 
morphism checking program or eliminated by other means. For k = 6 using this 
approach did not seem feasible. For 6-Vertex Cover, we utilized McKay’s 
graph generation program geng (part of his Gtools/Nauty package I.VIcKyOl 1 
and obtained some tight new upper bounds on the structure of vertex cover 
obstructions. 

2.1 Directly Checking Non-isomorphic Graphs 

We now discuss a natural method for finding a set of obstructions for a lower 
ideal from an available set of non-isomorphic graphs. We need to generate a 
complete set of non-isomorphic (connected) graphs, which is large enough to 
contain the set of obstructions sought. 

To decide if a graph is an obstruction it helps to have an efficient membership 
algorithm GA for the targeted lower ideal IF. For the 6-Vertex Cover family 
of graphs we used an implementation of the Balasubramanian et. al. linear-time 
vertex cover algorithm piFT},98| . By using GA for IF, an algorithm to decide if 
a graph is an obstruction is almost trivial. If a given graph G is an obstruction 
for iF, then G is a minimal graph such that G ^ J- . For a lower ideal T we only 
need to check that each ‘one-step’ minor of G is in IF. A general algorithm to 
decide if G is an obstruction for T is presented below. 
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Procedure IsObstruction(GraphMembershipAlgorithm GA, Graph G) 
if GA{G) = true then return false 
for each edge e in G do 

G' = the resulting graph after deleting e in G 
if GA(G') = false then return false 
+ G" = the resulting graph after contracting e in G 

+ if GA{G”) = false then return false 

endfor 
return true 

end 

The above algorithm IsObstruction will work for any minor-order lower 
ideal. For a given input finite graph G, if the graph family membership algo- 
rithm GA operates in polynomial time, then this algorithm IsObstruction has 
polynomial-time complexity. However, for some particular lower ideals, the al- 
gorithm can be simplified (e.g., remove lines marked with -|-’s). 

Lemma 1. A graph G = {V, E) is an obstruction for fc-VERTEX Cover if and 
only if VC{G) = k + 1 and VC{G\uv) = k for all uv € E. 

Proof. Let the graph G be an obstruction for ^-Vertex Cover. This implies 
that if any edge in G is deleted then the vertex cover decreases, by the definition 
of an obstruction. The size of the vertex cover decreases by one to exactly k, 
otherwise VG{G) < k+ 1. 

Now we prove the other direction. Let G = (V, E) be a graph, and T denote 
a fixed fc-VERTEX Cover lower ideal such that G ^ T . Suppose if any edge in 
G is deleted, then the resulting graph G' S T . Thus for each edge uv in G, a set 
of vertices V of cardinality k can be found which covers all edges in G except 
edge uv. 

Let u be the reserved vertex and v be the deleted vertex after uv is contracted. 
Since V covers all edges in G except uv, V covers each edge wv where w ^ u. We 
have w GV'. After contraction of uv, for each edge wv, a new edge wu is made 
in G'. Since w G V , V covers wu. Thus all new edges are covered by V . Hence 
after doing any edge contraction for any edge uv in G, a vertex cover V' where 
\V'\ = k covers all edges in the resulting graph G'. That is G' G T . Therefore if 
each edge deletion causes G' G E and G ^ F , then G is an obstruction. □ 

Thus according to the above lemma, for fc- Vertex Cover, our obstruction 
deciding algorithm is simplified. This means it does not have to check edge 
contraction minors (i.e., create the graphs G" in the IsObstruction procedure). 
This greatly reduces the overall computation time for deciding if a graph is 
an obstruction since doing any one edge contraction, with most graph data 
structures, is not relatively efficient (as compared with the actual time needed 
to delete any one edge). 
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2.2 Properties of Connected fc~VERTEX Cover Obstructions 

The number of connected non-isomorphic graphs of order n increases exponen- 
tially. For n = 11, the number of connected graphs is 1006700565. If we could 
process one graph per microsecond we would still need over 11 days of running 
time. Since there exist obstructions larger than this for 6- Vertex Cover we 
clearly need a more restricted search space. Thus we will filter the input by 
exploiting other properties of the /c- V ertex Cover obstructions. 

A few important properties for the ^-Vertex Cover obstructions are now 
systematically presented. 

Lemma 2. Any connected obstruction for the fc-VERTEX Cover lower ideal is 
a biconnected graph. 

Proof. Let u be a cut- vertex in a connected obstruction O for A:-Vertex Cover. 
Also let Cl, C 2 , . . . , Cm >2 be the connected components of 0 \{t} and C' = 
0[V{Ci) U {v}], where 0[X] denotes the subgraph induced by vertices X. Each 
C' denotes the part of the graph containing the component Ci, the vertex v, and 
the edges between v and Ci. Since O is an obstruction to A:- Vertex Cover. We 
have yC(Ci) = k. Any vertex cover for IJ™ ^ ^ vertex cover for O, 

where vertex v may be repeated in several C'. Thus, ^C'(C') > VC{0) = 
k + 1. This implies that there exists an i such that VC(C') = VC{Ci) + 1. Now 
O' = Cj) UC- is a proper subgraph of O. But VC(0') = k+1 contradicts 
O being an obstruction. So O does not have any cut-vertices. Therefore, O is a 
biconnected graph. □ 

A nice filter for our graph generator is the following result. 

Lemma 3. A vertex in an obstruction for /c-Vertex Cover has degree at most 
k + 1. 

Proof. Suppose u is a vertex with degree at least A: -|- 2 in an obstruction O = 
{V,E) for A:- Vertex Cover. Let O' be the resulting graph by deleting any edge 
uv of O. Since O is an obstruction for A:- Vertex Cover, we have VC (O') = k. 
Hence, in G there is a set of vertices V CV which covers all edges in G except 
uv and \V'\ = k. Since V does not cover edge uv, u and v . 

Thus V must contain all the neighbors of u except v. Hence V contains at 
least A; -I- 2 — 1 vertices in the neighborhood of u. Thus we have | V'| > A: -|- 1 > A:, 
contradicting V being a witness vertex cover to O' . 

Hence the degree of u is at most A; -|- 1. Therefore a vertex in an obstruction 
for A;- Vertex Cover has maximum degree A: -I- 1. □ 

Using the above result, along with Lemma^ we can easily derive the follow- 
ing. 

Lemma 4. There is only one connected obstruction for A:-Vertex Cover with 
k + 2 vertices, which is the complete graph Kk+ 2 - Furthermore, no other obstruc- 
tion for A:-Vertex Cover has fewer vertices. 
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The previous lemma shows that K ^+2 is the smallest connected obstruction 
for A:- Vertex Cover. We can also show that this is the unique obstruction 
with maximum degree. 

Lemma 5. If a vertex in a connected obstruction for /c-Vertex Cover has 
degree k + 1, then this obstruction is the complete graph with k + 2 vertices 
(Kk+2). 

Proof. Suppose O = {V,E) is a connected obstruction for A:-Vertex Cover 
and rt is a vertex in O with degree k + 1 and let N (u) be the neighborhood of 
u. Thus |fV(u)| = fc + 1. 

Let uv be any edge incident to u. Let O' = 0\uv; we know VC{0') = k. 
This is illustrated in Figure El Let V' be a minimum vertex cover of O' . We have 
u and V (otherwise O' is not an obstruction). 




Fig. 2. A vertex u in O with degree fc + 1. The black vertices must be in the 
vertex cover whenever uv is deleted. 

Since V covers all edges incident to u except uv, we have V = N{u)\{v}. 
Since N{u)\{v} is the vertex cover for O' , v is not adjacent to any other vertex 
w where w ^ N{u), otherwise, w must also be in V . Thus N{v) C N{u) U {u}. 

Since v is any vertex in N(u), any vertex in N(u) is connected to u or vertices 
in N{u). Thus {«} U N{u) = V. Hence O has k + 2 vertices. By Lemma El a 
connected obstruction with k + 2 vertices for A;-Vertex Cover is the complete 
graph Kk+ 2 , thus O = Kk +2 in this case. □ 

This lemma shows that for any obstruction O = {V, E) for fc-VERTEX Cover, 
if A is the maximum degree of O and \V\ > k + 2 then A < k + 1. 

2.3 Vertex Bounds for fe-VERTEX Cover 

We now present two important results that yield sharp upper bounds on the 
number of vertices for connected and disconnected /c-Vertex Cover obstruc- 
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tions. Later we will give some edge bounds that may be generalized (with some 
effort) to fc- Vertex Cover. 

Theorem 1. A connected obstruction for /c-Vertex Cover has at most 2k + l 
vertices. 

Proof. Assume O = {V,E) is a connected obstruction for fc- Vertex Cover, 
where \V\ = 2k + 2. We prove O is not a connected obstruction for /c-Vertex 
Cover by contradictions. The same argument also holds for graphs with more 
vertices. 

If O is a connected obstruction of A:-Vertex Cover, then VC{0) = k + \. 
Hence V can be split into two subsets Vi and V2 such that Vi is a fc + 1 vertex 
cover and V 2 = V\Vi. Thus IV2I = 2fc + 2 — (fc + 1) = fc + 1. Obviously, no 
edge exists between any pair of vertices in V2, otherwise Vi is not a vertex cover. 
Each vertex in Vi has at least one vertex in V2 as a neighbor, otherwise it can 
be moved from Vi to V2- (be., the vertex is not needed in this minimal vertex 
cover) . 

Thus the neighborhood of V2, NiV^) is Vi. We now prove that no subset S 
of V2 bas |IV(<S')| < IS”!. [This result, in fact, will immediately exclude the case 
|V| > 2fc + 2.] 

By way of contradiction, assume V3 = S' is a minimal subset in V2 such that 
l-^(ib3)l < 1^3 1- We say V3 is minimal whenever if T is any subset in V3 then 
1-^(2^) I > \T\- Note V3 will contain at least 3 vertices since every vertex has 
degree at least 2 in biconnected graphs (see Lemma EJ. 

Let 14 = V(Va). Thus all edges adjacent to V3 are covered by V4. Thus 
V 4 CVi and [141 < jVaj. Let I4 = V2\V3 and Vq = Vi\V4. The graph O can be 
split as indicated in Figure 0 




Fig. 3. Splitting the vertex set of O into four subsets. 
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We now prove that the existence of such a set V3 forces O to be disconnected. 
Since V3 is assumed minimal, no subset T in V3 has |fV(r)| < \T\. Thus if we 
delete any vertex in V3, leaving Vg, then any subset T in has |A^(T)| > |T|. 

Recall that a matching in a bipartite graph is a set of independent edges 
(with no common end points). Recall Hall’s Marriage Theorem IIHaKlhl : 

Hall’s Marriage Theorem: 

A bipartite graph B = {X\,X2tE) has a matching of cardinality \Xi\ if and only 
if for each subset A C Xi, |7V(A)| > |A|. 

Thus there is a matching of cardinality \Vf\ in the bipartite induced subgraph 
O' = {Vf, N{V^)) in O. Thus there are |Vg'| = IV3I — 1 independent edges between 
the set V3 and the set V4. 

Let us now delete all edges between V4 and V5 and all edges between V4 and 
Vfj. Let Cl = 0[Vi3 U V4], C2 = 0[Vs U Vq] be these disconnected components 
in the resulting graph. (The induced graph Ci is connected, while the induced 
graph C2 may or may not be.) 

As discussed above, there exists IV3I — 1 independent edges in Ci. Thus to 
cover these edges, VC{C\) > IV3I — 1. We know IV4I < IV3I and all edges incident 
to V3 are covered by V4, thus VC(Ci) = IV4I. 

Now consider the graph C2. Suppose, VC(C2) < IVe]. Since all deleted edges 
are also covered by V4, V4 U VC(C2) must cover all edges in O. Thus VC( 0 ) = 
IV4I + VC(C2) <|14| + |V6| = fc+l. This contradicts that O is an obstruction 
for fc-VERTEX Cover. Thus the assumption that VC(C2) < |V(j| is not correct. 
Hence VC(C2) = iVe] even though edges between Ci and C2 were deleted. 

Thus RC(Ci UC2) = iVlil + lVel = fc+1. Therefore O can not be a connected 
obstruction for A:-Vertex Cover since the resulting graph still requires a /c+ 1 
vertex cover whenever all edges between V4 and V5 and all edges between V4 and 
Vf3 are deleted. 

Therefore the assumption that there exists a minimal subset V3 in V2 is not 
correct. Hence any subset S in V2 has |iV(<S')| > [S']. 

Once again, by applying Hall’s Marriage Theorem, there is a matching of 
cardinality /c + 1 in the induced bipartite subgraph O' = (V2,Vi) of O. To cover 
these fc + 1 independent edges, a vertex cover of size k + 1 is necessary. We 
know that if O is a connected graph, there must exist other edges in O except 
these fc + 1 independent edges. If those edges are deleted, the resulting graph 
still has vertex cover fc + 1. Thus O can not be a connected obstruction for 
fc-VERTEX Cover if it has more than 2fc + 1 vertices. □ 

By extending the above result, we can prove the following corollary by in- 
duction. 

Corollary 1 . Any obstruction for fc-VERTEX Cover has order at most 2k + 2. 

We now mention some observations about the fc-VERTEX Cover obstruc- 
tions. The known obstructions for the small cases (see |™4| for fc < 5, in 
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addition to this paper’s fc = 6) indicate a very interesting feature: the more ver- 
tices, the fewer edges. As proven above, a complete graph ATfe +2 is the smallest 
obstruction (and only one), but it seems to have the largest number of edges. The 
cycle obstruction C 2 fc+i appears to be the only connected obstruction with the 
largest number of vertices (and appears to have the smallest number of edges). 

2.4 Computing All Obstructions for 6-Vertex Cover 

We divide our computational task for finding all the connected obstructions for 
6- Vertex Cover into two steps. 

(1) We find all connected obstructions with order at most 11 for 6-Vertex Cov- 
er. This step is very straight forward. Our algorithm IsObstruction is applied 
to all non-isomorphic biconnected graphs with a number of vertices between 9 
and 11, of maximum degree 6, and of maximum number of edges 33. 

(2) To find all connected obstructions of order 12 and 13 for 6- Vertex Cover, 
new degree and edge bounds were found and used, as indicated below. 

By the Lemma0 we know that if a connected obstruction O has a vertex of 
degree 7 for 6- Vertex Cover, then this obstruction must be ATg. Thus we only 
have to consider graphs with maximum degree 6. Furthermore, if the degree for 
O is 6, we can easily prove the following statement by cases. 

Statement 1. If a connected obstruction O = (V, E) for 6-Vertex Cover has 
a vertex of degree 6, then \V\ < 10. Furthermore, if |V| = 10 then \E\ < 24, and 
if |V| = 9 then \E\ < 25. 

A consequence of the proof for Statement 1 (see IDAOPj ! is the following: if 
a connected obstruction O for 6-Vertex Cover has 11 or more vertices then 
the degree of every vertex is less than or equal to 5. With another series of case 
studies of graphs with vertices of degree 5 we have the following. 

Statement 2. If a connected obstruction O = (V, E) for 6-Vertex Cover has 
12 vertices then \E\ < 24. Further, if O has 13 vertices then \E\ < 26. 

The proof of this statement (see pX0Q| 1 also gives a degree bound of 4 for 
any obstruction with 13 vertices. The search space for finding all obstructions 
with orders 12 and 13 for 6-Vertex Cover has been extremely reduced. 

3 Final Remarks 

Our obstruction checking procedure for ^-Vertex Cover (primarily based on 
Lemmata 01 and EJ was efficient enough to process all biconnected graphs of 
order at most 11 for k = 6. To find all of the obstructions with 12 vertices 
for 6-Vertex Cover, we only needed to test all non-isomorphic (biconnected) 
graphs with maximum degree 5 and at most 24 edges. For finding all obstructions 
with 13 vertices for 6-Vertex Cover, we only need to check all non-isomorphic 
graphs with maximum degree 4 and at most 26 edges. This search space is very 
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manageable; it requires about two months of computation time! In Figures EHSI 
we display all 188 connected obstructions for 6-Vertex Cover. 

We now mention a couple of areas left open by our research. It would be 
nice to have a proof that C 2 k+i is the only connected obstruction for /c-Vertex 
Cover, since we now known that 2fc + 1 is an upper bound on the number 
of vertices. There are some interesting open questions regarding edge bounds 
for A:-Vertex Cover. As we pointed out earlier, the obstructions start having 
fewer edges as the number of vertices increases. More theoretical results that 
generalizes our specific bounds for k = Q seem possible. 
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Abstract. Considering the three parameters of a directed graph: order, 
diameter and maximum out-degree, there are three optimal problems 
that arise if we optimise in turn each one of the parameters while hold- 
ing the other two parameters fixed. These three problems are related 
but as far as we know not equivalent. One way to prove the equiva- 
lence of the three problems would be to prove that the optimal value 
of each parameter is monotonic in each of the other two parameters. 
It is known that maximum order is monotonic in both diameter and 
maximum out-degree and that minimum diameter is monotonic in max- 
imum out-degree. In general, it is not known whether the other three 
possible monotonicity implications hold. In this paper, we consider the 
problem of determining the smallest diameter K{n,d) of a digraph G 
given order n and maximum out-degree d. Using a new technique for 
construction of digraphs, we prove that K(n,d) is monotonic for all n 
such that <'n < d!^ + d!^~^ , thus solving an open problem posed in 

1988 by Miller and Fris. 



1 Introduction 

In communication network design, there are several factors which should be 
considered. For example, each processing element should be directly connected 
to a limited number of other processing elements in such a way that there always 
exists a connection route from one processing element to another. Furthermore, 
in order to minimize the communication delay between processing elements, the 
directed connection route must be as short as possible. 

A communication network can be modelled as a digraph, where each process- 
ing element is represented by a vertex and the directed connection between two 
processing elements is represented by an arc. The number of vertices (processing 
elements) is called the order of the digraph and the number of arcs (directed 
connections) from a vertex is called the out-degree of the vertex. The diameter 
is defined to be the largest of the shortest paths (directed connection routes) 
between any two vertices. 

In graph-theoretical terms, the problems in the communication network de- 
signs can be modelled as optimal digraph problems. The example described above 
corresponds to the so-called order/degree problem: Construct digraphs with the 

D.-Z. Du et al. (Eds.): COCOON 2000, LNCS 1858, pp. 19.8- r?rm 2000. 

(c) Springer- Verlag Berlin Heidelberg 2000 
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smallest possible diameter K(n^ d) for a given order n and maximum out-degree 
d. Before we discuss in more detail optimal digraph problems, we give more 
formal definitions of the terms in graph theory that will be used. 

A directed graph or digraph C? is a pair of sets (V,A) where y is a finite 
nonempty set of distinct elements called vertices; and A is a set of ordered pair 
(it, v) of distinct vertices u,v gV called arcs. 

The order n of a digraph G is the number of vertices in G, that is, n = \V\. 
An in-neighbour (respectively out-neighbour) of a vertex u in G is a vertex u 
(respectively w) such that (u,v) G A (respectively (v,w) G A). The set of all 
in-neighbour (respectively out-neighbour) of a vertex v is denoted by N~{v) 
(respectively N~^(v)). The in-degree (respectively out-degree) of a vertex v is the 
number of its in-neighbour (respectively out-neighbour). If the in-degree equals 
the out-degree (= d) for every vertex in G, then G is called a diregular digraph 
of degree d. 

A walk of length I in a digraph G is an alternating sequence voaiVia 2 ...aiVi 
of vertices and arcs in G such that = {vi-i,Vi) for each i, 1 < i < 1. A walk 
is closed if vq = vi. If all the vertices of a iiq — vi walk are distinct, then such a 
walk is called a path. A cycle is a closed path. 

The distance from vertex u to v, denoted by 6{u,v), is the length of the 
shortest path from vertex u to vertex v. Note that in general 6{u,v) is not 
necessarily equal to S(v,u). The diameter of digraph G is the longest distance 
between any two vertices in G. 

Let t) be a vertex of a digraph G with maximum out-degree d and diameter 
k. Let Hi, for 0 < * < fc, be the number of vertices at distance i from v. Then 
n-i < d®, for 0 < z < A:, and so 

^ jfc+i _ 1 

n = m <1 -\- d-\- (f — - — (1) 

d — 1 

The right-hand side of m is called the Moore bound and denoted by Md^k- If 
the equality sign holds in dU then the digraph is called Moore digraph. It is well 
known that Moore digraphs exist only in the trivial cases when d = 1 (directed 
cycles of length k-\-l, Ck+i, for any fc > 1) or A: = 1 (complete digraphs of order 
d -I- 1, Kd+i, for any d > 1) [1 01,'Ij . Thus for d > 2 and k > 2, inequality (P) 
becomes 

jfe-i-i _ j 

n<d-kd^-k...-kd'"= — ; (2) 

d — 1 

Let Q {n, d, k) denotes the set of all digraphs G, not necessarily diregular, of 
order n, maximum out-degree d, and diameter k. Given these three parameters, 
there are three problems for digraphs G G G{n, d, k), namely, 

1. The degree/diameter problem: determine the largest order N(d,k) of a di- 
graph given maximum out-degree d and diameter k. 

2. The order/degree problem: determine the smallest diameter K(n,d) of a 
digraph given order n and maximum out-degree d. 
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3. The order/diameter problem: determine the smallest degree D(n,k) of a 
digraph given order n and diameter k. 

Miller pointed out the following implications regarding to the three prob- 
lems above. 

a. di < ^2 => N{di,k) < N{d 2 ,k); 

b. k\ < k2 ^ N{d,ki) < N{d,k2)\ 

c. di < d 2 => K{n,di) > K{n,d 2 ). 

It is not known whether the following three implications hold or not. 

1. fci < /c 2 -D(n, ki) > D{n, k 2 ), for {ki,k 2 < n — 1); 

2. m < ri2 K{m,d) < K{n2,d); 

3. ni < ri 2 D{ni, k) < D{n 2 , k), for (ni, ri 2 > fc -I- 1). 



Miller and Fris [Z| posed the problem of whether or not K{n, d) is monotonic 
on restricted intervals of n, for example, for all n such that < n < d^ + 

d^~^ . In this paper, we solve the open problem using a new technique for the 
construction of digraphs. 

2 Construction 

In this section we introduce a new technique for the construction of digraphs, as 
described in the following theorems. This construction will be used to prove the 
main results presented in the next section. 

Theorem 1 // G G Q{n,d,k) and N~^{u) = N~^{v) for any vertex u,v £ G, 
then there exists G\ G Q{n — l,d,k'), k' < k. 

Proof. Suppose that N~^{u) = N~^{v) for any vertex u,v G G. Let Gi be a 
digraph deduced from G by deleting vertex u together with its outgoing arcs 
and reconnecting the incoming arcs of u to the vertex v. Obviously, the new 
digraph G\ has maximum out-degree the same as the maximum out-degree of 
G. To prove that the diameter of Gi is at most k, we only need to consider 
the distance from vertices of N~{u) to other vertices in G reached through u. 
Just deleting u might cause a vertex in N~(u) not to reach some vertex within 
distance at most k. However, this is overcome when we reconnect the vertices of 
N~ (u) to V. Thus we obtain a digraph Gi with order n— 1, maximum out-degree 
d and diameter k' , where k' < k. □ 

Figure E^a) shows an example of digraph G G ^(6,2,2) with the property 
A^+(l) = fV+(5), fV+(2) = A^+(6), and fV+(3) = A^+(4). Applying Theorem 
[Oto the digraph by deleting vertex 6 together with its outgoing arcs and then 
reconnecting its incoming arcs to vertex 2 (since fV+(2) = A^+(6)), we obtain a 
new digraph Gi G ^(5,2,2) as shown in Figure [I](b). 
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(c) 



Fig. 1. The digraphs G G ^(6, 2, 2), Gi G G{5, 2, 2), and Gs G ^1(4, 2, 2) 



Digraphs which are obtained from line digraph construction contain some 
pairs of vertices with the same out-neighbourhoods. The line digraph L(G) of 
a digraph G is the digraph where each vertex of L{G) corresponds to arc of G; 
and a vertex uv of L{G) is adjacent to a vertex wx if and only if v = w, that is, 
if the arc (u,v) is adjacent to the arc (iv,x) in G. 

The order of the line digraph L(G) is equal to the number of arcs in the 
digraph G. For a diregular digraph G of out-degree d > 2, the sequence of line 
digraph iterations 

L(G),L^(G)=L(L(G)),...,r(G) = L(r~^(G)), ... 

is an infinite sequence of diregular digraphs of degree d. For more detail explana- 
tions about line digraph iterations, see P). The following facts about line digraph 
are obvious. 

Fact 1 If a digraph G is a diregular of degree d > 2, diameter k and order n 
then the line digraph L{G) is also diregular of degree d, order dn and diameter 
k + 1. 

Fact 2 If L{G) is a line digraph of diregular digraph G with degree d > 2, 
diameter k and order n then there are n tuples of d vertices in L{G) with iden- 
tical out-neighbourhood. In other words, if Ui G N~{v) and wj G N~^{v) where 
V, Ui, Wj G V{G) for every i = 1,2, ..., d and j = 1, 2, ..., d then 

N~^{uiv) = N~^{u2v) = ... = N~^{udV) = {VWI,VW2, ..., VWd} 

where UiV,vWj G V{LG). 

Applying Theorem Q to line digraphs, we obtain 

Theorem 2 If LG G Q{dn,d,k) is a line digraph of a diregular digraph G G 
Q{n, d,k — 1), then there exists digraph LGr G G(dn — r, d, k'), k' < k, for every 
f < r < {d — l)n — 1. 
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Proof. Suppose that LG is a line digraph of the digraph G. By Fact [U LG is 
also diregular digraph of degree d, order dn and diameter k. From Fact|2l we 
have n tuples of d vertices in LG with identical out-neighbourhood. One of the 
tuples is 

N~^{uiv) = N~^{u2v) = ... = N~^{udV) = {VWI,VW2, ..., VWd} 

where UiV, vwj G V(LG) for every i = 1,2, ..., d and j = 1, 2, ..., d. 

We apply Theorem ^ by deleting vertex uiv together with its outgoing arcs 
and reconnecting its incoming arcs to the vertex UiV for any i = 2,3, ...,d, we 
obtain the digraph with order one less than the order of LG. Repeating this 
notion can be done d — 1 times for each tuple. Since there are n tuples of d 
vertices with identical out-neighbourhood in LG, we can delete the vertices as 
many as (d — l)n — 1 vertices to obtain the digraph with diameter at most k. 
Thus the digraphs LGr G Q{dn — r, d, k'), k' < k, for every 1 < r < (d — l)n — 1 
do exist. □ 

Every time we delete a vertex u and reconnect its incoming arcs to other 
vertex v with the same out-neighbourhood as u leads to the number of incoming 
arcs of V increases. This implies that the resulting digraphs is not regular with 
respect to the in-degree, but still regular in out-degree. Therefore the resulting 
digraphs LGr are non-diregular. 

This situation becomes more interesting, when we consider Kautz digraphs, 
Ka G Q{d^ + df^~^,d,k) for d > 3 and k > 2. Since Kautz digraphs can be 
obtained by (fc — l)-fold iterations of the line digraph construction applied to 
complete digraph Kd+i, Theorem 0 immediately implies that 

Corollary 1 For d^~^ + d^“^ < n < d^ + d^~^ , there exist digraphs G G 
Gin, d, k'), k' <k. □ 

Figure ^a) shows an example of the Kautz digraph G G ^(6,2,2) which is 
a line digraph of and the non-diregular digraphs G\ G ^(5,2,2) and G 2 G 
C/(4, 2, 2) (Figure^b) and FigureOKc)) deduced from digraph G. In this example, 
to obtain G\ G ^(5, 2, 2) we delete the vertex 6 from G together with its outgoing 
arcs and reconnect its incoming arcs to the vertex 2. Furthermore, to obtain 
G 2 G C/(4, 2,2), we delete the vertex 5 from Gi together with its outgoing arcs 
and reconnect its incoming arc to the vertex 1. 

3 The Diameter 

Kautz digraphs Ka G G{d^ + d^“^, d, fc) are the largest known digraphs of max- 
imum out-degree d > 3 and diameter k >2. Kautz digraph Ka can be regarded 
as a line digraph L^~^{Kd+i) of the complete directed graph Kd+i. There are 
exactly d-l- 1 different out-neighbourhoods, each shared by d^'“^ vertices. There- 
fore Kautz digraphs can be used as base digraphs in our construction (Theorem 
01 to produce other digraphs with smaller order while keeping the diameter no 
larger than Ka and the maximum out-degree the same. Consequently, we have 
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Theorem 3 If < n < df + df ^ for d>2 and k >2, then K{n, d) = k. 
Proof. If the diameter of the digraph is k — 1, the inequality becomes 

N{d,k)<^^. (3) 

Obviously, if n > , then K{n,d) > k. By Corollary Q there exist digraphs 

of order n, for d^~^ + df~^ < n < d^ + d^~^ , out-degree d and diameter at 
most k. Since d^~^ + d^~‘^ < it follows that for digraphs with maximum 

out-degree d, order n, where < n < df + d^~^, the minimum diameter is 

k, i.e., K{n,d) = k. □ 

Theorem 0 can be restated as 

Theorem 3' K{n, d) is monotonic for all n such that < n < df + df~^ . □ 

The interval of order n can be tightened if we consider particular cases of out- 
degree d and if we utilise some additional known results, as described below. 

Let us first consider the case of maximum out-degree 3. Baskoro et al P 
proved that there is no digraph of maximum out-degree 3 and diameter k > S 
with order one less than the Moore bound. Therefore, for d = 3 and A: > 3, 
n < ^ ^2 diameter A: — 1, n < ^ 

Consequently, if n > ^ then K(n,S) > k. Combining this result and 
Corollary [Q we have 

Theorem 4 For every fc > 3 and ^ < n < 4 x 3^“^, K(n,3) = k. □ 

Another special case we can consider is digraphs of maximum out-degree 2. 
Alegre constructed diregular digraph of (constant) out-degree 2, diameter 4 and 
order 25, that is, Al G ^(25,2,4). This digraph is larger than Kautz digraph of 
the same maximum out-degree and diameter. Applying line digraph iterations 
and Theorem 13 to Al, we obtain 

Lemma 1 For every k > 5 and 25 x 2^“® < n < 25 x 2^“^, Q{n,d,k) is not 
empty. □ 

Furthermore, Miller and Sirah 0 proved that 

Theorem 5 If d = 2 and k > 2, then n < 2^+^ — 4. □ 

This result implies that for n > 2^ — 3, K{n, 2) > k. Combining this result and 
Lemma 0 we have 

Theorem 6 For every k > 5 and 2^ — 3 < n < 25 x 2^“"^, K{n, 2) = k. □ 

We conclude this paper with our current knowledge of the K(n,2) problem 
for n < 200 (Table [IJ. New results as displayed in bold-type, for example G G 
0(49, 2, 5) which is obtained by applying TheoremQto the line digraph of Alegre 
digraph (see Figure El . 
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n 


K{n, 2) 


n 


K[n,2) 


n 


K{n,2) 


n 


K{n,2) 


n 


K{n,2) 


1 


0 


41 


5 


81 


6 


121 


6 or 7 


161 


7 


2 


1 


42 


5 


82 


6 


122 


6 or 7 


162 


7 


3 


1 


43 


5 


83 


6 


123 


6 or 7 


163 


7 


4 


2 


44 


5 


84 


6 


124 


6 or 7 


164 


7 


5 


2 


45 


5 


85 


6 


125 


6 or 7 


165 


7 


6 


2 


46 


5 


86 


6 


126 


7 


166 


7 


7 


3 


47 


5 


87 


6 


127 


7 


167 


7 


8 


3 


48 


5 


88 


6 


128 


7 


168 


7 


9 


3 


49 


5 


89 


6 


129 


7 


169 


7 


10 


3 


50 


5 


90 


6 


130 


7 


170 


7 


11 


3 


51 


5 or 6 


91 


6 


131 


7 


171 


7 


12 


3 


52 


5 or 6 


92 


6 


132 


7 


172 


7 


13 


4 


53 


5 or 6 


93 


6 


133 


7 


173 


7 


14 


4 


54 


5 or 6 


94 


6 


134 


7 


174 


7 


15 


4 


55 


5 or 6 


95 


6 


135 


7 


175 


7 


16 


4 


56 


5 or 6 


96 


6 


136 


7 


176 


7 


17 


4 


57 


5 or 6 


97 


6 


137 


7 


177 


7 


18 


4 


58 


5 or 6 


98 


6 


138 


7 


178 


7 


19 


4 


59 


5 or 6 


99 


6 


139 


7 


179 


7 


20 


4 


60 


5 or 6 


100 


6 


140 


7 


180 


7 


21 


4 


61 


6 


101 


6 or 7 


141 


7 


181 


7 


22 


4 


62 


6 


102 


6 or 7 


142 


7 


182 


7 


23 


4 


63 


6 


103 


6 or 7 


143 


7 


183 


7 


24 


4 


64 


6 


104 


6 or 7 


144 


7 


184 


7 


25 


4 


65 


6 


105 


6 or 7 


145 


7 


185 


7 


26 


4 or 5 


66 


6 


106 


6 or 7 


146 


7 


186 


7 


27 


4 or 5 


67 


6 


107 


6 or 7 


147 


7 


187 


7 


28 


4 or 5 


68 


6 


108 


6 or 7 


148 


7 


188 


7 


29 


5 


69 


6 


109 


6 or 7 


149 


7 


189 


7 


30 


5 


70 


6 


110 


6 or 7 


150 


7 


190 


7 


31 


5 


71 


6 


111 


6 or 7 


151 


7 


191 


7 


32 


5 


72 


6 


112 


6 or 7 


152 


7 


192 


7 


33 


5 


73 


6 


113 


6 or 7 


153 


7 


193 


7 


34 


5 


74 


6 


114 


6 or 7 


154 


7 


194 


7 


35 


5 


75 


6 


115 


6 or 7 


155 


7 


195 


7 


36 


5 


76 


6 


116 


6 or 7 


156 


7 


196 


7 


37 


5 


77 


6 


117 


6 or 7 


157 


7 


197 


7 


38 


5 


78 


6 


118 


6 or 7 


158 


7 


198 


7 


39 


5 


79 


6 


119 


6 or 7 


159 


7 


199 


7 


40 


5 


80 


6 


120 


6 or 7 


160 


7 


200 


7 



Table 1. The values of K{n, d) for d = 2 and n < 200. 
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Abstract. At each step of the online independent set problem, we are 
given a vertex v and its edges to the previously given vertices. We are 
to decide whether or not to select n as a member of an independent 
set. Our goal is to maximize the size of the independent set. It is not 
difficult to see that no online algorithm can attain a performance ratio 
better than n — 1, where n denotes the total number of vertices. Given 
this extreme difficulty of the problem, we study here relaxations where 
the algorithm can hedge his bets by maintaining multiple alternative 
solutions simultaneously. 

We introduce two models. In the first, the algorithm can maintain a 
polynomial number of solutions (independent sets) and choose the largest 
one as the final solution. We show that is the best competitive 

ratio for this model. In the second more powerful model, the algorithm 
can copy intermediate solutions and grow the copied solutions in different 
ways. We obtain an upper bound of 0(n / (k log n)), and a lower bound of 
log® n), when the algorithm can make operations per vertex. 

1 Introduction 

An independent set (IS) in a graph is a set of vertices that are mutually non- 
adjacent. In the online independent set problem, a graph is given one vertex at 
a time along with edges to previous vertices. The algorithm is to maintain a 
proper solution, and must at each step decide irrevocably whether to keep the 
presented vertex as a part of the solution. Its goal is to find as large a set as 
possible, relative to the size of the largest independent set in the graph. 

Online computation has received considerable attention as a natural model 
of some properties of the real world such as the irreversibility of time and the 
unpredictability of the future. Aside from modeling real-time computation, it 
also has been found to be important in contexts where access to data is limited, 
e.g. because of unfamiliar terrain, space constraints, or other factors determining 
the order of the input. 

The online IS problem occurs naturally in resource scheduling. Requests for 
resources or sets of resources arrive online, and two requests can only be serviced 
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09480055 and 10205215. 
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simultaneously if they do not involve the same resource. When the objective is 
to maximize the throughput of a server, we have a (possibly weighted) online IS 
problem. Previously, this has primarily been studied in the context of scheduling 
intervals 0- 

The usual measure of the quality of an online algorithm A is its competitive 
ratio or performance ratio. For the online IS problem, it is defined to be p{n) = 
u\&yii(zi^opt{I)/A{I), where /„ is the set of all the sequences of vertices and 
corresponding edges given as n-vertex graphs, opt(I) is the cardinality of the 
maximum independent set of /, and A(I) is the size of the independent set 
found by A on /. 

One reason why the online IS problem has not received a great deal of atten- 
tion is that any algorithm is bound to have a poor competitive ratio. It is not 
hard to see that in the basic model, once the algorithm decides to add a vertex 
to its set, the adversary can ensure that no further vertices can be added, even 
though all the other n — 1 vertices form an independent set. Thus, the worst 
possible performance ratio of n — 1 holds for any algorithm, even when the input 
graph is a tree. As a result, we are interested in relaxations of the basic model 
and their effects on the possible performance ratios. 

In the coloring model studied recently in |3j, the online algorithm constructs 
a coloring online. The IS solution output will be the largest color class. This 
circumvents the strait-jacket that the single current solution imposes on the 
algorithm, allowing him to hedge his bets. This model relates nicely to online 
coloring that has been studied for decades. In particular, positive results known 
for some special classes of graphs carry over to this IS problem: O(logn) for 
trees and bipartite graphs j I I 2 t?^ . and for fc-colorable graphs |S|. Also, 

2-competitive ratio is immediate for line graphs, as well as ratio k for fc -|- 1- 
claw free graphs. On the other hand, for general graphs, no algorithm has a 
competitive ratio better than n/4 [3, a ratio that is achieved by the FirstFit 
coloring algorithm. 

Our results. We propose the multi-solutions model, where the algorithm can 
maintain a collection of independent sets. At each step, the current vertex can 
be added to up to r(n) different sets. The coloring model is equivalent to the 
special case r{n) = 1. We are particularly interested in polynomially bounded 
computation, thus focusing on the case when r{n) = n^, for some constant k. 
For this case, we derive an upper and a lower bound of 9{n/ logn) on the best 
possible competitive ratio. For r(n) constant, we can argue a stronger f2{n) lower 
bound. These results are given in Section 2. 

A still more powerful model proposed here is the inheritance model. Again 
each vertex can participate in limited number of sets. However, the way these 
sets are formed is different. At each step, r(n) different sets can be copied and 
the current vertex is added to one copy of each of the sets. Thus, adding a vertex 
to a set leaves the original set still intact. This model can be likened to forming 
a tree, or a forest, of solutions: at each step, r{n) branches can be added to trees 
in the forest, all labeled by the current vertex. Each path from a root to a node 
(leaf or internal) corresponds to one current solution. 
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The inheritance model is the most challenging one in terms of proving lower 
bounds. This derives from the fact that all the lower bounds used the other 
models, as well as for online graph coloring m, have a property called trans- 
parency: once the algorithm makes his choice, the adversary immediately reveals 
her classification of that vertex to the algorithm. This transparency property, 
however, trivializes the problem in the inheritance model. Still we are able to 
prove lower bounds in the same ballpark, or I7(n/(e^ log^ n)) when r(n) = . 

These results are given in Section 3. 

Notation. Throughout this paper, n denotes the total number of given ver- 
tices, and we write vertices v\, V 2 , • ■ ■ , Vn, where vertices are given in this order. 
Namely, at the ith step, the vertex Vi is given and an online algorithm decides 
whether it selects Vi or not. 

2 Multi-solutions Model 

In the original online IS problem, the algorithm can have only one bin, one 
candidate solution. In the multi-solutions model, the algorithm can maintain a 
collection of independent sets, choosing the largest one as the final solution. The 
model is parameterized by a function r(n); each vertex can be placed online 
into up to r(n) sets. When r(n) = 1, this corresponds to forming a coloring 
of the nodes. When r(n) = 2”, an online algorithm can maintain all possible 
independent sets and the problem becomes trivial. We are primarily interested 
in cases when r(n) is polynomially bounded. 

We find that the best possible performance ratio in this model is 9{n/ log r(n)). 
In particular, when r{n) is constant, the ratio is linear, while if r(n) is polyno- 
mial, the ratio is 6*(n/ logn). 

Remark. In online problems, the algorithm does not know the length n of the 
input sequence. Thus, in the multi-solutions model, the algorithm can assign the 
i-th. node to r(z) different sets, rather than r(n). However, it is well known that 
not informing of the value of n has very limited effect on the performance ratio, 
affecting at most the constant factor. Thus, in order to simplify the arguments, 
we may actually allow the algorithm r(n) assignments for every vertex. 

Theorem 1. There is an online algorithm with competitive ratio at most 
0(n/ log r(n)) in the multi-solutions model. 

Proof. Let t = [log r(n) — 1] . View the input as sequence of blocks, each with t 
vertices. For each block, form a bin corresponding to each non-empty subset of 
the t vertices. Greedily assign each vertex as possible to the bins corresponding 
to sets containing that vertex. This completes the specification of the algorithm. 

Each node is assigned to at most r(n) bins. Each independent set in each 
block will be represented in some bin. Thus, the algorithm finds an optimal 
solution within each block. It follows that the performance ratio is at most n/t, 
the number of blocks. □ 
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Lower Mounds We first give a linear lower bound for the case of r{n) being 
constant. 

Theorem 2. Any online IS algorithm has a competitive ratio at least 2 (r{n)+i) 
in the multi- solutions model. 

Proof. We describe the action of our adversary. Vertices have one of two states: 
good and bad. When a vertex is presented to an online algorithm, its state is 
yet undecided. Immediately after an online algorithm decides its action for that 
vertex, its state is determined. If a vertex v becomes good, no subsequent vertex 
has an edge to v. If v becomes bad, all subsequent vertices have an edge to v. 
Note that all good vertices constitute an independent set, in fact the optimal 
one. Also, note that once a bad vertex is added to a bin, no succeeding vertex 
can be put into that bin. 

The adversary determines the state of each vertex as follows: If an online 
algorithm put the vertex v into a bin with r(n) vertices or more, then v becomes 
a bad vertex; otherwise it’s good. Thus, no bin contains more than r(n) + 1 
vertex. This upper bounds the algorithm’s solution. 

Each bad node becomes the last node in a bin with r(n) copies of other good 
nodes. Since each node appears in at most r(n) bins, we have that the number 
of good nodes is at least the number of bad nodes, or at least n/2. This lower 
bounds the size of the optimal solution. □ 

Theorem 3. Any online IS algorithm has a competitive ratio at least 2 iog(ra".r(w)) 
in the multi- solutions model. 

Proof. In the lower bound argument here, we consider a randomized adversary. 
As before, we use good vertices and bad vertices. This time, our adversary makes 
each vertex good or bad with equal probability. We analyze the average perfor- 
mance of an arbitrary online algorithm. The probability that a fixed bin contains 
more than C = log(n^ • r(n)) vertices is at most (|)^ = „ 2 .r(„) since all but the 
last vertex must be good. The online algorithm can maintain at most n-r{n) bins. 
Thus, the probability that there is a bin with more than C vertices is at most 
nr{n) x Namely, with high probability the online algorithm outputs 

an independent set of size at most C. On the other hand, the expected value of 
the optimal cost, namely the expected number of good vertices, is | x n = ^. 
Hence, the competitive ratio is at least n/2(log(n^ • r(n))). □ 

Remark. Observe that the graphs constructed have a very specific structure. For 
one, they are split graphs, as the vertex set can be partitioned into an independent 
set (the good nodes) and a clique (the bad nodes). The graphs also belong to 
another subclass of chordal graphs: interval graphs. Vertex i can be seen to 
correspond to an interval on the real line with a starting point at i. If the 
node becomes a bad node, then the interval reaches far to the right with an 
endpoint at n. If it becomes a good node, then the right endpoint is set at 
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i + 1/2. It follows that the problem of scheduling intervals online for maximizing 
throughput is hard, unless some information about the intervals’ endpoints are 
given. This contrasts with the case of scheduling with respect to makespan, 
which corresponds to online coloring interval graphs, for which a 3-competitive 
algorithm is known H . 

Also, observe that the latter lower bound (as well as the construction in the 
following section) is oblivious (in that it does not depend on the actions of the 
algorithm), and hold equally well against randomized algorithms. 



3 Inheritance Model 

The model we shall introduce in this section is much more flexible than the 
previous one. At each step, an algorithm can copy up to r(n) bins and put the 
current vertex into those copies or a new empty bin. (We restrict the number 
of copies to r(n) — 1 when an online algorithm uses an empty bin, because it 
is natural to restrict the number of bins each vertex can be put into to r(n) 
considering the consistency with multi-solutions model.) Note that this model is 
at least as powerful as the multi-solutions model with the same r(n). 

The upper bound follows from Theorem 1. 

Corollary 4. There is an online algorithm with competitive ratio at most 
0(n/ log r(n)) in inheritance model. 

3.1 A Lower Bound 

As mentioned before, a lower bound argument is a challenging one. Before show- 
ing our result, we discuss some natural ideas and see why those arguments only 
give us poor bounds. 



Discussion 1 Consider the transparent adversary strategy of the proof of 
Theorem 3. There is an online algorithm that against this adversary always 
outputs an optimal solution for split graphs. The algorithm uses only two bins 
at each step. At the first step, it maintains the empty bin and the bin that 
contains v\. At the Ah step (i > 2), the algorithm knows that the vertex Vi-\ 
is good or bad by observing whether there is an edge {vi-\,Vi) or not. If Vi-\ 
is good, the algorithm copies the bin containing Ui_i, adds Vi to the copy, and 
forgets about the bin not containing Vi-\. Otherwise, if Vi-\ is bad, exchange 
the operations for the two bins. Observe that this algorithm outputs a solution 
consisting of all good vertices. 

What then is the difference between our new model and the old one? Note 
that, in the inheritance model, an algorithm can copy bins it currently holds. 
To simulate this operation in the multi-solutions model, an algorithm has to 
construct in advance two bins whose contents are the same. Since the above 
algorithm in the inheritance model copies at each step, an algorithm in the 
multi-solution model must prepare exponential number of bins in the worst case 
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to simulate it. Hence, for the current adversary, the inheritance model is expo- 
nentially more powerful than the multi-solutions model. Thus, to argue a lower 
bound in the inheritance model, we need a more powerful adversary. 

Discussion 2 In the discussion of the previous section, what supported the 
online algorithm in the new model is the information that the vertex is good 
or bad given immediately after the algorithm processed that vertex. Hence the 
algorithm knows whether it is good or bad at the next step. We now consider 
the possibility of postponing this decision of the adversary. 

Divide the vertex sequence into rounds. At the end of each round, the adver- 
sary determines the states of vertices it gave in that round. A vertex then acts 
as a good vertex while its state is undetermined. Until the end of the round, 
it is impossible for an online algorithm to make a decision depending on states 
of vertices given in that round. If, at the end of a round, an online algorithm 
maintains a bin that contains many vertices of that round, we “kill” that bin 
by making one of those vertices bad. This can be easily done by setting an 
appropriate probability that a vertex becomes bad. 

This approach still allows for an algorithm with a competitive ratio of ^/n. 
The algorithm maintains an independent set I containing one vertex from each 
round that contains a good vertex. It also puts all the vertices of each round i 
into an independent set A. It is easy to see that the largest of / and the Ifs is 
of size at least \/opt, when the optimal solution contains opt vertices. 

A Better Lower Bound The reason why the above argument did not give 
a satisfactory lower bound is that the adversary strategy was still weak: Once 
a decision is made, the algorithm knows that a good vertex will never become 
bad. Here, we consider an adversary that makes a probabilistic decision more 
than once (approximately log n times) to each vertex. 

Theorem 5. Any online IS algorithm has a competitive ratio at least „)3 

in the inheritance model with r{n) = where e is the base of the natural 
logarithm. 

Proof. We first describe informally the action of the adversary. He presents 
lo = log^ n new vertices in each round. Between rounds he makes a decision to 
make vertices bad, not only for vertices given in the previous round, but also for 
some other vertices already given. 

The goal of the adversary is to stunt the growth of bins containing many 
vertices by making some of them bad. At the same time, it needs to ensure that 
the fraction of vertices that are good in the end is large. The adversary repeats 
the following steps n/lo times: 

(1) The adversary presents a block of Iq new vertices. All those vertices 
are good and each of them has edges to all bad vertices previously given. 

(2) As long as there are two blocks bi, 62 of the same size, replace them 
by the union of the two. At the same time, each vertex of the combined 
block is changed into a bad vertex independently with probability 
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Let t = \og{n/lfj). Note that each block is of size 2 *^o, 0 < z < t, and the 
number of blocks is at most t + 1. 

Each time blocks are merged, the adversary is said to attack the respective 
vertices by probabilistically changing them to bad. The probability of a vertex 
surviving an attack is 1 — Each vertex receives at most t attacks. Hence, the 
probability that a vertex is good at the end of the game is at least (1 — > 

. The expected number of good vertices, which is the measure of the optimal 
solution, is then at least n/e^+^. 

We now evaluate the performance of the online algorithm. It then appears 
that a smaller Iq is harder for the algorithm to keep large bins. Unfortunately, 
a smaller Iq also makes proofs harder. The following setting, Iq = (logn)^, is 
somewhat conservative: 



Lemma 6 . Suppose that Iq = (logn)^. Then any online IS algorithm in the 
inheritance model with r(n) = cannot produce independent sets whose size is 
(logrz)^ or more in the worst case. 

Proof. We maintain the following invariant (*): 

There is no bin B containing only good vertices that has 
more than (logn)^ vertices from a single block. (*) 

When the first Iq = (logn)^ vertices are given, (*) is obviously true. Sup- 
pose now that (*) is true at some moment. It then also remains true when the 
adversary introduces in step ( 1 ) a new block of Iq vertices. Consider now the 
case when two blocks bi and 62 are merged. Let S be the set of good vertices 
in bi U 62 contained in a particular bin B. If [S'! < Iq the invariance remains 
true, so suppose otherwise. Recall that the adversary attacks each vertex in S 
with probability Therefore, the probability that all vertices in S survive 
the attacks is at most 



(1 _ ^i±l')(logn)" ^ ^ 

logn'^ ^(fc+l)loge- 



The probability that there is some bin that contains more than Iq vertices from 
61 U 62 is at most 



n 



fc+i 



^(fc+l) loge 






< 1 . 



That is, it holds with some positive probability that no bin includes (logn)^ 
or more good vertices in this combined block. This implies that there exists an 
adversary strategy (a choice of assignments to at most {k + 1) / log n- fraction of 
the vertices of 61 U 62) that maintains the invariant (*). 

It follows that no bin contains more than t ■ Iq < log^ n vertices. □ 
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Abstract. In this paper, we consider a two-dimensional version of the 
on-line bin packing problem, in which each rectangular item that should 
be packed into unit square bins is “rotatable” by 90 degrees. An on-line 
algorithm that uses an unbounded number of active bins is proposed. 
The worst case ratio of the algorithm is at least 2.25 and at most 2.565. 



1 Introduction 

The bin packing problem is one of the basic problems in the fields of theoretical 
computer science and combinatorial optimization. It has many important real- 
world applications, such as memory allocation and job scheduling, and is well- 
known to be NP-hard that is a main motivation of the study and development 

of approximation algorithms for solving the problem. 

The classical (one-dimensional) on-line bin packing problem is the problem 
of, given a list L of items (xi, X 2 , . . . , where Xi G (0,1], packing all items 
in L into a minimum number of unit-capacity bins. Note that term “on-line” 
implies that items in L are consecutively input, and the packing of an item must 
be determined before the arrival of the next item. In the literature, it is known 
that the worst case ratic0 of an optimal on-line bin packing algorithm OPT, 
denoted by is at least 1.5401 0 and at most 1.588 0 if the number of 

active bintpused by the algorithm is unbounded; and is exactly 1.69103 0 if it 
is bounded by some constant k. 

In this paper, we consider a two-dimensional version of the problem, in which 
each rectangular item that should be packed into unit square bins is “rotatable” 
by 90 degrees. The goal of the problem we consider is to pack all items in L 
into a minimum number of unit square bins in such a way that 1) each item is 
entirely contained inside its bin with all sides parallel to the sides of the bin, and 

2) no two items in a bin overlap. It should be worth noting that in the “normal” 
setting of higher dimensional bin packing problems, it is further requested that 

3) the orientation of any item is the same as the orientation of the bin, i.e., each 

^ A formal definition will be given in Section 2. 

^ We consider a class of algorithms in which each bin becomes active when it receives 
its first item; and once a bin is declared to be inactive (or closed), it can never 
become active again. 

D.-Z. Du et al. (Eds.): COCOON 2000, LNCS 1858, pp. 210- IT?ni 2000. 

(c) Springer- Verlag Berlin Heidelberg 2000 
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item must not be rotated; and under such a setting, it is known that an optimal 
on-line algorithm OPT fulfills 1.907 < R'gpj, < 2.85958 (= 1.691032) |TI^ . 

The two-dimensional on-line bin packing problem with rotatable items has 
several interesting applications, e.g., job scheduling on two-dimensional mesh 
computers and the fioor planning of VLSI layouts, while the rotation of items is 
prohibited in some of other applications, such as the assignment of newspaper 
articles to newspaper pages. A naive conjecture is that the worst case ratio 
could be improved by removing the condition on the rotatability since there is 
an instance in which the removal of the condition reduces an optimal number of 
bins to a half (e.g., consider a sequence of items of height | and width | each; 
by rotating a half of items, we can pack four such items into a bin, while each 
bin can accommodate at most two items without rotation). Another conflicting 
conjecture is that it is not very easy to improve the ratio, since the lower bound 
can also be improved by allowing rotations. In the following, we propose two 
algorithms Ai and A 2 such that 2.5624 < < 2.6112 and 2.25 < < 2.565, 

where the second algorithm is an extension of the first algorithm. 

The remainder of this paper is organized as follows. Section 2 introduces some 
basic definitions. The classifications of items and strips used in the proposed 
algorithms will also be given. Sections 3 and 4 describe the proposed algorithms. 
Section 5 concludes the paper with some future directions of research. 

2 Preliminaries 

2.1 Basic Definitions 

Let A be an on-line bin packing algorithm, and A{L) the number of bins used by 
algorithm A for input sequence L. The asymptotic worst case ratio (or simply, 
worst case ratio) of algorithm A, denoted as i?“, is defined as follows: 

lim sup 

n — ^00 

where R'^ max | I OPT{L) = n| . 

where OPT(L) denotes the number of bins used by an optimal bin packing 
problem provided that the input is L. An on-line bin packing problem is said to 
use /c-bounded-space (or, simply bounded) if, for each item, that is given in an 
on-line manner, the choice of bins into which it may be packed is restricted to 
a set of k or fewer active bins. If there is no such restriction, we say that the 
problem uses unbounded space (or, simply unbounded). 

2.2 Classification of Items 

Let T be the set of all rectangular items, denoted as {x,y), such that 0 < 
^ 2/ ^ 10 where x and y represent the height and width of the rectangle, 

® Note that we may assume x < y without loss of generality, since each item is 
rotatable by 90 degrees. 
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respectively. We first classify items in T into four subsets by the value of the 
cc-coordinate, as follows: Tq = {(a;,y)| | < a: < 1 }, Ti = {{x,y)\ | < a; < | }, 
T2 = {{x,y)\ I < a: < I }, and T3 = {{x,y)\ 0 < a; < | }. See Figure 0(a) for 
illustration. Subsets T2 and T3 are further partitioned into subsets by the value 
of the y-coordinate, as follows. 

Definition 1 (Partition of T3). Subset T3 is partitioned into four subsets 
To , To , Tf , To by the value of the u-coordinate, as follows: T 9 = {(x,y) G T3 1 | < 
y < 1 },T^ = {(x,y) G T3I i < 2/ < I }, T| = {{x,y) G T3I | < y < | }, and 
Ti = {{x,y) G T3I a; < 2/ < I }. 

In order to define a partition of T2, let us consider a sequence oq, ai, . . . of 
reals defined as follows: 

ao i and | + gx2i-i+3 for » = 1, 2, . . . . (1) 

For example, oi = | + 5 = |, 02 = | + = |, and 03 = ^ + ^ = ^. Note 

that I < Oi < i holds for any j > 0. Let e be a positive real that will be used 
as a parameter in the analysis of our algorithms, and (f) a function from R+ to 
Z defined as follows: 

m = riog2(2/(9 f)-i)i. (2) 

By using above definitions, the partition of subset T2 is described as follows. 

Definition 2 (Partition of T2). Given parameter e > 0, subset T2 is parti- 
tioned into three subsets T2 as follows: 



^2°=^ U {(2;,2/)| flj+1 < a; < Oj, 1 - Cj < 2/ < 1} 

t=o 

U{(a:,2/)| I < a; < 1 - < y < 1 } 

^2 U “i+1 < a: ^ “T 5 < y ^ “ “f} 

t=o 

U{(a:,2/)| 5 < a; < ^ <y < 1 - a,2.(£)} 

T^ = {{x,y)eT2\ I <y< i}. 

The overall partitioning of T into 9 subsets is also illustrated in Figured (a). 
Note that, for example, if the given item is (|, ^) then it belongs to subset T| 
since | | | and | < ^. As for the minimum size of items in each 

subset, we have the following two lemmas (proofs are omitted here). 

Lemma 1. If i > log 2 (^ — l) then | x (1 — a^) > | — |. 

Lemma 2 . 1 ) The size of items in Tq is greater than 2 ) the size of items in 
Ti is greater than and 3 ) the size of items in is greater than | — f . 
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2.3 Strips 

In the proposed algorithms, each bin is split into small strips, and a given item is 
packed into a strip of an appropriate size. Before describing the ways of splitting 
of each bin into strips, let us consider the following infinite set X of reals: 

^ I z = 3, 4, 5,7, 9,11, 13 and j = 0, 1, 2 , . . . }. 

By using X, two (infinite) sets of reals, Xiong and Xmed, are defined as follows: 



Xiong X U { i } 




(3) 


Xmed = X U { Oi, 02, . . 




(4) 



where Ui = ^ + g^2»-i-e3 for * > 1- la what follows, given 0 < a; < 1, let i denote 
the smallest element in Xiong that is greater than or equal to x, and let x denote 
the smallest element in Xmed that is greater than or equal to x. Note that for 
any x < ^, x/x {= x/x) > |. 

Three types of strips used in the proposed algorithms are defined as follows. 

Definition 3 (Strips). A subregion of a bin is said to be a long strip if it has 
a fixed width 1 and a height that is drawn from set Xiong. A subregion of a bin 
is said to be a short strip if it has a fixed width ^ and a height that is drawn 
from set Xiong. A subregion of a bin is said to be a medium strip if 1) it has 
a fixed width (resp. height) | and a height (resp. width) that is drawn from set 
X, or 2) it has a width (resp. height) 1 — Oi and a height (resp. width) Oj, for 
some 1 < f < 0(e). 



3 The First Algorithm 

3.1 Algorithm Description 

In this section we describe our first on-line algorithm. The algorithm is described 
in an event-driven manner; i.e., it describes the way of packing an input item 
{x, y) into an active bin of an appropriate size. 

Algorithm Ai 

If (x, y) G Tq\JTi, then open a new bin, put the item into the bin, and close the 
bin; otherwise (i.e., if (x,y) GT 2 U T3) execute the following operations: 

Case 1: if (x,y) G T° U Tg, then obtain an unused long strip of height x (as 
an active strip) by calling get_long_strip(a;), put the item into the returned 
strip, and close the strip. 

Case 2: if (x,y) G T 2 U Tg, then obtain an unused medium strip of height 
X (as an active strip) by calling get_medium_strip(a;), put the item into the 
returned strip, and close the strip. 

Case 3: if (x,y) G T| U T|, then obtain an unused short strip of height x (as 
an active strip) by calling get_short_strip(x), put the item into the returned 
strip, and close the strip. 
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Case 4: if (a;,?/) G Tg, then put the item into an active long strip of height x 
if such a strip with enough space is available. If there exist no such strips, 
after closing all active long strips of height x, obtain an unused long strip 
of height X by calling get_long_strip(x), and put the item into the returned 
strip. Note that the strip is not closed at this time. 

Figure □ (b) illustrates the correspondence of items with strips, in algorithm 
Ai . The performance of the algorithm, in terms of the occupation ratio of each 
closed strip, is estimated as follows (proofs of Lemmas 0 and El are omitted). 

Lemma 3 (Long strips). Any closed long strip of size S is occupied hy items 
with total size at least (| + e)^ if the height of the strip is and with size at 
most if the height is less than 

Proof. By Lemma El a closed long strip of height ^ is filled with items of size at 
least I — f ; i.e., the occupation ratio of the bin is at least | — e. On the other 
hand, by the same lemma, it is shown that a closed long strip of height i, for 
* > 3, is filled with items of total size more than | x that is at least i. 
Hence the lemma follows. 



Lemma 4 (Short strips). Any closed short strip of size S is occupied by an 
item with size at least if the height of the strip is and with size at most 
^ , if the height is less than | . 



Lemma 5 (Medium strip). 1) A medium strip with height Ui for 1 < z < 
4>{e), is filled with an item with size more than g (= | x 5 )/ o,nd 2) a medium 
strip with height for some i > 3, is filled with an item with size more than 

2 {i+l) T+T ^ 2)- 



3.2 How to Split a Bin into Strips ? 

Next, we consider the ways of splitting (unused) bins into strips. In the following, 
we give a formal description of three procedures, get_long_strip, get_medium_strip, 
and get_short_strip, that have been used in the proposed algorithm. 

procedure getJong_strip(a;) 

1. Recall that x denotes the smallest element in Xiong that is greater than or 

equal to x. If there is no unused long strip with height x2^ (< 1) for any 

j > 0, then execute the following operations: 

(a) Let j' be an integer such that x2^ < 1 < x2.^ Note that x2^ = 

1 1 1 1 1 J_ or i 

2’ 3’ 5’ 7’ 9’ 11 > ^ 13’ 

(b) Open an unused bin, and partition it into long strips with height 
x2^ each. 

2. Select one of the “lowest” strips, say Q, with height x2^ (< 1) for some j > 0. 
Let x2^ be the height of strip Q. 
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3. If i” = 0 (i.e., if the height of Q is x) then return Q as an active strip; 
otherwise, partition Q into j" + 1 strips of heights x2^ ~^,x2^ . . . ,x, x, 

respectively, and return a strip of height a; as an active one. 

procedure get_short_strip(a;) 

1. If there is an unused short strip of height x, then return it as an active one. 

2. Otherwise, obtain an unused long strip with height x by calling procedure 
get_long_strip(x), partition the obtained one into two unused short strips with 
height X each, and return one of them as an active one. 

procedure get_medium_strip(a;) 

1. Recall that x denotes the smallest element in Xmed that is greater than or 
equal to a;. If there is an unused horizontal medium strip with height x (or, 
vertical medium strip with width x), then return it as an active one. 

2. Otherwise, \i x > then open an unused bin, partition it into two horizontal 
medium strips of height x and two vertical medium strips of width x as in 
Figure H(a), and return one of the four strips as an active one. 

3. Otherwise, i.e., if a; < |, execute the following operations: 

(a) If there is no unused medium strip with height x2^ (< I) for any j > 0, 
then open an unused bin, partition it into horizontal medium strips 
of height x2^ and [ ^ J vertical medium strips of width x 2 ^ as in 

FigureEl (b), where j' is an integer such that x2^ < 1 < x2^ 

(b) Select one of the “lowest” medium strips, say Q, with height x2^ (< 1) 
for some j > 0- Let x2^ be the height of strip Q. 

(c) If j” = 0 (i.e., if the height of Q is x) then return Q as an active 
strip; otherwise, partition Q into j” + I strips of heights x2^ ~^,x2^ 

. . . ,x,x, respectively, and return a strip of height x as an active one. 

By Lemma0and get_medium_strip, we immediately have the following lemma. 

Lemma 6. Any closed bin that is split into medium strips is filled with items 
with total size at least 



3.3 Analysis 

Theorem 1. < 2.6112 + e for any e > 0. 

Proof. Let L be a sequence of n items, consisting of no items in Tq, n\ items in 
Ti, where no + n\ < n. Note that algorithm A\ requires bo (= no) bins for items 
in To and b\ (= ni) bins for items in T\. Suppose that it also uses 62 (> 0) bins 
split into long strips of height 1 / 2 , and 63 (> 0 ) bins split into medium strips of 
height more than 1/3. Let S be the total size of items in L, and S '4 the total size 
of items that are not packed into the above no + ni + b 2 + &3 bins; i.e., items of 
height at most 1/3. 

If e is a constant, algorithm Ai uses at most no + ni + b 2 + b^ + 2 S 4 + 0(1) 
bins and OPT requires at least maxjno -I- maxjni, 62 }, <S'} bins. Hence — 
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{no + ni + 62 + ^3 + 25 ' 4 }/{max{no + maxjni, 62}, S'}}. Since S4 < S — |no — 
(| — e)b2 — §63 — ^ hy Lemmas 0 | El and 0 we have 

^00 ^ 2S + inp + + (^ + e')&2 - 1^3 

~ maxjno + maxjni, 62}, S} 

When no + max{m,62} < S, < 2 + (;|){§no + (| + e')^2 + |«i}- If 
m < 62, i?Xi < 2 + (l/S)|fno + (^ + £^^2} < 2 + + e', and if m > 62, 

-STi ^ 2 + (l/S){gno + + e')ni} < 2 + ^ + e'. On the other hand, when 

no + max{m,62} > S, it is at most < 2 + jlno + (5 + e')^2 + |«i}/|no + 
max{m,62}}- If ni < 62, S 2 + jlnp + (^ + e')^2}/(no + 62) < 2 + ^ + e', 
and if ni > 62, < 2 + jfno + (^ + e')’^i}/(£*o + ni) < 2 + ^ + e'. Hence 

the theorem follows. 



Theorem 2 . > 2.5624. 



Proof. Consider the following eight items 

A:(i+5,i + 5) H:(i + 5,|-(5) C:(| + (5,i + 5) D:{^ + S,^ + S) 



where <5 is a sufficiently small constant such that item B is in T° . 

Let L be a sequence of items consisting of n copies of items A, C, F, H, and 
2n copies of items B,D,E,G. Note that since ^ = ^ = \ ~ by 

selecting 5 to be sufficiently small, we can pack one copy of A, C, F, F[ and two 
copies of H, D, E, G, as in FigureOI Hence OPT{L) = n. 

On the other hand, in algorithm Hi, 1) n copies of item A require n bins; 
2) 2n copies of item B require n bins, since B is classified into T2 ; 3) n copies 
of item C require j bins, since G is classified into Tf] 4) 2n copies of item D 
require bins; 5) 2n copies of item E require ^ bins; 6) n copies of item F 
require ^ bins; 7) 2n copies of item G require ^ bins; and 8) n copies of item 
H require bins. Hence, in total, it requires at least 



n 2n 
n + n+- + - 



2n 

16 



n 

Too 



2n 



n 

144 



> 2.5624n 



bins. Hence the theorem follows. 



4 The Second Algorithm 

In this section, we propose an extension of the first algorithm. The basic idea 
behind the extension is to pack an item in Ti and item(s) in T3 in the same bin, 
as much as possible. Recall that under the “on-line” setting, we cannot use any 
knowledge about the input sequence L in the algorithm. 
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4.1 Algorithm Description 

In our second algorithm, called Algorithm A 2 , when an item in Ti is packed into 
a new bin, the bin is split into two subregions; a square region of size | x | 
that will be filled with the item in T\ , and the remaining “L-shaped” region that 
is reserved for possible usage by items in T3 (note that the reserved L-shaped 
region will never be used, in the worst case). On the other hand, when an item 
in Ts is packed into a new bin, under a certain condition described below, the 
bin is partitioned into two subregions, and the square region of size | x | is 
reserved for possible usage by items in T\ (note again that the reserved square 
region will never be used, in the worst case). 

In order to balance the advantage and disadvantage of the above scenario, 
we partition 20% of bins devoted to items in for the above use; the remaining 
80% of bins are used in the same manner to Algorithm A\. A formal description 
of the algorithm is given as follows. 

Algorithm A 2 

Case 1: If {x,y) G Tq U T 2 , then it acts as in Algorithm Ai. 

Case 2: If {x, y) G Ti, then put the item into an active bin if there is an active 
bin that has been split into two subregions, and the small square region has 
not accommodated any item; if there is no such bin, then open a new bin, 
split it into two subregions, and pack the item into the small square region. 
Case 3: If (x,y) G T3, then it obtains a strip of height x (= x) by calling 
procedures described in Section 3.2 with a minor modification such that an 
unused L-shaped region is a possible candidate for the splitting into strips, 
and pack the item into a strip to satisfy the following conditions: 80% of bins 
dedicated to items in T 3 are filled with closed strips; and 20% of such bins 
are filled with strips in such a way that there remains an available square 
region of size | x | . 

4.2 Analysis 

Theorem 3. < 2.565 -I- e for any e > 0. 

Proof. Let L be a sequence of n items. In this proof, we use the same symbols 
as in the proof of Theorem^ e.g., no and ni denote the numbers of items in Tq 
and Ti, respectively. 

Recall that algorithm Ai uses at most no -I- ni -I- 62 + ^3 + 2 S 4 + 0(1) bins and 
an optimal algorithm OPT requires at least max{no -I- max{ni, 62}, S} bins. 

If ni < then since all items in Ti are packed into bins that is split into 
small square region and an L-shaped region, the “average” occupation ratio of 
every closed bin can be increased from at least | to at least | (note that the 
average occupation ratio of bins with strips of height at most on the other 
hand, reduces from f to |); i.e., the worst case ratio is bounded by | = 2.5. 
Hence, in the following, we assume that rii > ^. 

Ifni > ^7 we can imaginally suppose that the number of bins split into strips 
of height at most | each reduces from 2 S 4 to aS'4 for some constant a < 2, by 



218 Satoshi Fujita and Takeshi Hada 



using the splitting method illustrated in Figure 5. If the number of (imaginal) 
bins used by items in is aS' 4 , then by using a similar argument to Theorem Q 
we have 

(note that if a = 2, we have the same bound with Theorem P). The number of 
(imaginal) bins reduces from 2^4 to at most || if all items are packed into strips 
of height i; at most (< ||) if all items are packed into strips of height and 
at most ^ (< y|) if all items are packed into strips of height at most Hence 
we have a < for any sequence L; i.e., 



R 



00 

A2 



< 



„ 11 

2 H X 

36 



24 

13 




< 2.565. 



Hence the theorem follows. 



Finally, by using a similar technique to Theorem |21 we have the following 
theorem (we may simply consider a sequence consisting of items H, B and C) . 

Theorem 4. > 2.25. 



5 Concluding Remarks 

In this paper, we proposed two on-line algorithms for solving the two-dimensional 
bin packing problem with rotatable rectangular items. 

The proposal of a nontrivial lower bound is an important open problem 
to be solved (currently best lower bounds on the worst case ratio of an optimal 
algorithm for “rotatable” setting are the same as the one-dimensional case; i.e., it 
is at least 1.5401 for unbounded algorithms 0, and at least 1.69103 for bounded 
algorithms jSj). An extension to higher dimensional cases (e.g., three-dimensional 
box packing) would also be promising. 
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packed into long strips 





(a) Boundaries for the classification of (b) Correspondence of items with strips, 
items by the values of x- and ^-coordinates. 

Fig. 1. Classification of items. 




Fig. 2. Partition of a bin into medium strips. 
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Fig. 3. A bad instance to give a lower bound. 
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